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5 NOVEL PROSTATE-SPECIFIC OR TESTTS-SPECinC 

NUCLEIC ACID MOLECULES, POLYPEPTIDES, AlSnP DIAGNOSTIC 
AND TEIERAPEUTIC METHODS 

Field of The Invention 
1 0 The invention generally relates to the treatment of disorders 

associated with prostate and testis dysfunction and ceU proliferation, and 
specifically relates to the identification and use of novel genes for diagnosis 
and treatment of such disorders. 

15 Background of The Invention 

Genitourinary disorders are often difficult to diagnose and treat 
effectively because they are present non-specifically. Two causes of 
genitourinary disorders are disorders of the prostate gland and the testis. 

The prostate is a variable sized gland located in the male pelvis, and 

20 is made up of several different cell types, including epithelial cells and 

stromal cells. Prostate-associated disorders include prostate cancer, benign 
prostatic hyperplasia, and prostatitis. The male hormone testosterone and 
other androgen related hormones have major roles in the growth and 
fimction of the prostate. The testis is also subject to many defects, 

25 including developmental anomalies, inflammation, and cancer. 

In men, prostate cancer is the most commonly diagnosed cancer and 
the second leading cause of cancer mortality following skin cancer. In the 
initial stages, prostate cancer is dependent on androgens for growth, and 
this dependence is the basis for androgen ablation therapy. In most cases, 

30 however, prostate cancer progresses to an androgen-independent phenotype 
for which there is no effective therapy available at present 
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Currently, there is limited information regarding the molecular 
details of prostate cancer progression. Several independent ^proaches 
resulted in the identification of a few highly prostate-enriched genes that 
may have unique roles in this process. The first such gene discovered was 
5 Prostate Specific Antigen (PSA), the detection of which is currently used as 
a diagnostic tool and also as a marker for the progression of prostate cancer, 
albeit with significant limitations. More recently, several additional 
prostate-enriched genes were identified including prostate-specific 
membrane antigen (PSMA), prostate carcinoma tumor antigen 1 (PCTA-1), 
10 NKX3.1, prostate stem cell antigen (PSCA), DD3, and PCGEMl. 

It would be beneficial to provide reagents useful for the diagnosis 
and therapy of disorders associated with the prostate and the testis, as well 
as other tissues. 

15 Summary of flie Invention 

The invention provides, in general, a novel prostate-specific or 
testis-specific nucleic acid molecules, polypeptides, antibodies, and 
modulatory compounds for use in methods of diagnosing, treating, and 
preventing diseases and conditions of the prostate and testis, such as cancer. 

20 In a first aspect the invention provides a substantially pure prostate- 

specific or testis-specific polypeptide, including a sequence substantially 
identical to the sequence of any of SEQ ID NOS: 14, 29, 32, 34, 36, 41, or 
53. In a preferred embodiment of the first aspect, the substantially pure 
prostate-specific or testis-specific polypeptide includes the sequence of any 

25 of SEQ ID NOS: 14, 29, 32, 34, 36, 41, or 53. In another preferred 
embodiment, the invention provides an isolated nucleic acid molecule 
encoding a polypeptide of the first aspect, for example a nucleic acid 
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molecule including the sequence of any of SEQ ID NOS: 23, 28, 31, 33, 35, 
40, or 52. Preferably, the polypeptide is derived from a mammal, e.g., a 
human. 

In a second aspect, the invention provides an isolated prostate- 
5 specific or testis-specific nucleic acid molecule including a sequence 
substantially identical to SEQ ID NOS: 1-12, 22, 27, 30, and 51. 

In a third aspect, the invention provides an isolated prostate-specific 
or testis-specific nucleic acid molecule consisting essentially of SEQ ID 
NOS: 15-21, 24-26, 42-50, and 54-70. 
10 In preferred embodiments of some of the above aspects, the 

invention provides a vector, a cell, a cell including the vector, and a non- 
human transgenic animal including the isolated nucleic acid molecules. 

In a fourth aspect, the invention provides an isolated nucleic acid 
molecule that hybridizes under high stringency conditions to the 
15 complement of any of the sequences set forth in SEQ ID NOS: 1-12, 15-28, 
30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, where the isolated nucleic acid 
molecule encodes a prostate-specific or testis-specific polypeptide. 

In a fifth aspect, the invention provides an isolated nucleic acid 
molecule, where the nucleic acid molecule includes a sequence that is 
20 antisense to the coding strand of any of the prostate-specific or testis- 
specific nucleic acid molecules set forth in SEQ ID NOS: 1-12, 15-28, 30, 
31, 33, 35, 40, 42-50, 51, 52, or 54-70, or a firagment thereof 

In a sixth aspect, the iuvention provides a probe for analyzing a 
prostate-specific or testis-specific gene or homolog or fragment thereof, the 
25 probe having greater than 55% nucleotide sequence identity to a sequence 
encoding any of SEQ ID NOS: 1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 
52, or 54-70, or fragment thereof, where the fragment includes at least six 
amino acids, and the probe hybridizes under high stringency conditions to 
at least a portion of a prostate-specific or testis-specific nucleic acid 
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molecule. In a preferred embodiment of this aspect, the probe has 1 00% 
complementarity to a nucleic acid molecule encoding any of SEQ ID NOS: 
1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, or fragment thereof, 
where the fragment comprises at least six amino acids, and said probe 
5 hybridizes under high stringency conditions to at least a portion of a 
prostate-specific or testis-specific nucleic acid molecule. 

In a seventh aspect, the invention provides an antibody that 
specifically binds to a prostate-specific or testis-specific polypeptide that 
includes an amino acid sequence that is substantially identical to the amino 

10 acid sequence of any of SEQ ID NOS: 14, 29, 32, 34, 36, 41, or 53. 

In an eighth aspect, the invention provides a method of detecting a 
prostate-specific or testis-specific gene or fragment thereof in a cell, the 
method including contacting the nucleic acid molecule of any of SEQ ID 
NOS: 1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, or afragment 

15 thereof, where the fragnient is greater than about 18 nucleotides in length, 
with a preparation of genoiiaic DNA from the cell, imder high stringency 
hybridization conditions, and detecting DNA sequences having about 55% 
or greater nucleotide sequence identity to any of SEQ ID NOS: 1-12, 15-28, 
30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, thus identifying a prostate- 

20 specific or testis-specific gene or fragment thereof Nucleotides encoding 
the polypeptides of SEQ ID NOS: 38, 39, or 71-73 can also be used in an 
embodiment of this aspect. In a preferred embodiment of this aspect, the 
method includes detecting a neoplastic or cancer cell in a patient 
predisposed to or at risk for cancer, for example, for prostate cancer. 

25 In a ninth aspect, the invention provides a method for identifying a 

test compoimd that modulates the expression or activity of a prostate- 
specific or testis-specific polypeptide, the method includiag contacting the 
prostate-specific or testis-specific polypeptide with the test compound, and 
determining the effect of the test compound on the prostate-specific or 
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testis-specific polypeptide expression or activity. In a preferred 
embodiments of this aspect, the prostate-specific or testis-specific 
polypeptide includes an amino acid sequence substantially identical to the 
amino acid sequence of SEQ ID NOS: 14, 29, 32, 34, 36, 38, 39, 41, 53, or 
5 71-73, and firagments and analogs thereof 

In a tenth aspect, the invention provides a method of treating a 
mammal having a disorder of the prostate or testis, the method including 
administering to the mammal a therapeutically effective amount of a 
compound that modulates the activity or expression of a prostate-specific or 

10 testis-specific polypeptide, where the compound has a beneficial effect on 
the disorder in the manunal. In preferred embodiments of this aspect, the 
disorder is prostate cancer, the mammal is a human, or the prostate-specific 
or testis-specific polypeptide includes an amino acid sequence substantially 
identical to the amino acid sequence of SEQ ID NOS: 14, 29, 32, 34, 36, 

15 38, 39, 41 j 53, or 71-73, and firagments and analogs thereof. ' . ^ . 

In an eleventh aspect, the invention provides a pharmaceutical 
composition including at least one dose of a therapeutically effective 
amount of a prostate-specific or testis-specific polypeptide or fira^ent 
thereof, in a pharmaceutically acceptable carrier, the composition being 

20 formulated for the treatment of a disorder of the prostate or testis. 

In a twelfth aspect, the invention provides a kit for the analysis of a 
prostate-specific or testis-specific nucleic acid molecule, the kit including a 
nucleic acid molecule probe for analyzing a prostate-specific or testis- 
specific nucleic acid molecule present in a test subject 

25 In a thirteenth aspect, the invention provides a kit for the analysis of 

a prostate-specific or testis-specific polypeptide, the kit including an 
antibody for analyzing a prostate-specific or testis-specific polypeptide 
present in a test subject. 



5 



9 



WO 01/72962 PCT/USOl/09410 



As used herein, by "polypeptide," '"protein," or "polypeptide 
fragment is meant a chain of two or more amino acids, regardless of any 
post-translational modification (e,g., glycosylation or phosphorylation), 
constituting all or part of a naturally or non-naturally occurring 

5 polypeptide. By "post-translational modification" is meant any change to a 
polypeptide or polypeptide fragment during or after synthesis. Post- 
translational modifications can be produced naturally (such as during 
synthesis within a cell) or generated artificially (such as by recombinant or 
chemical means). A protein can be made up of one or more polypeptides. 

1 0 By "substantially pure polypeptide" or "substantially pure and 

isolated polypeptide" is meant a polypeptide (or a fragment thereof) that 
has been separated from components that naturally accompany it 
Typically, the polypeptide is substantially pure when it is at least 60%, by 
weight, free from the proteins and naturally occurring organic molecules 

15 with which it is naturally associated. Preferably, the polypeptide is a 

prostate-specific or a testis-specific polypeptide that is at least 75%, more 
preferably at least 90%, and most preferably at least 99%, by weight, pure. 
A substantially pure prostate-specific or a testis-specific polypeptide may 
be obtained by standard techniques, for example, by extraction from a 

20 natural source (e.g., prostate or testis tissue or cell lines), by expression of a 
recombitiant nucleic acid encoding a prostate-specific or a testis-specific 
polypeptide, or by chemically synthesizing the polypeptide. Purity can be 
measured by any appropriate method, e.g., by colunm chromatography, 
polyacrylamide gel electrophoresis, or HPLC analysis. 

25 A protein or polypeptide is substantially free of naturally associated 

components when it is separated from those contatninants that acconq)any 
it in its natural state. Thus, a protein that is chemically synthesized or 
produced in a cellular system different from the cell from which it naturally 
originates will be substantially free from its naturally associated 
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components. Accordingly, substantially pure polypeptides not only include 
those derived from eukaryotic organisms but also those synthesized in E. 
coli or other prokaryotes. 

The term "identity" is used herein to describe the relationship of the 
5 sequence of a particular nucleic acid molecule or polypeptide to the 
sequence of a reference molecule of the same type. For example, if a 
polypeptide or nucleic acid molecule has the same amino acid or nucleotide 
residue at a given position, compared to a reference molecule to which it is 
aligned, there is said to be "identity" at that position. 

10 The level of sequence identity of a nucleic acid molecule or a 

polypeptide to a reference molecule is typically measured using sequence 
analysis software with the default parameters specified therein, such as the 
introduction of gaps to achieve an optimal aligmnent. The "identity" of 
two or more nucleic acid or polypeptide sequences can therefore be readily 

1 5 calctilated by known methods, including but not limited to those described 
in Computational Molecular Biology, Lesk, A.M., ed., Oxford University 
Press, :New York, 1988; Biocomputing: Informatics and Genome Projects, 
Smith, D.W., ed., Academic Press, New York, 1993; Computer Analysis of 
Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., Humana 

20 Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von 
Heinje, Academic Press, 1987; and Sequence Analysis Primer, Gribskov, 
and Deverexix, eds., M, Stockton Press, New York, 1991; and Carillo and 
Lipman, SIAM J. AppKed Math. 48:1073, 1988. 

Methods to determiae identity are available in publicly available 

25 computer programs. Computer program methods to determine identity 
between two sequences include, but are not limited to, the GCG program 
package (Devereux et al.. Nucleic Acids Research 12(1): 387, 1984), 
BLASTP, BLASTN, and FASTA (Altschul et al., J. Mol. Biol. 215.- 403 
(1990). The well known Smith Waterman algorithm may also be used to 
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determine identity. The BLAST program is publicly available from NCBI 
and other sources {BLAST Manual, Altschul, et al., NCBI NLM NIH 
Bethesda, MD 20894). Searches can be performed in URLs such as the 
following http://ww.ncbi.nhn.nih.gov/BLAST/unfmishedgenome.html: or 
5 http://www.tigr.o^g/cgi-bin/BlastSearch^last.cgi. These software programs 
match similar sequences by assigning degrees of homology to various 
substitutions, deletions, and other modifications. Conservative substitutions 
typically include substitutions within the following groups: glycine, alanine; 
valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, 

10 glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 

A nucleic acid molecule or polypeptide is said to be "substantially 
identical" to a reference molecule if it exhibits, over its entire length, at 
least 50%, 60%, or 70%, preferably at least 80% or 90%, more preferably at 
least 95%, and most preferably at least 99% identity to the sequence of the 
v: 15 reference molecule. For polypeptides, the length of comparison sequences 
is at least 16 amino acids, preferably at least 20 amino acids or at least 
25 amino acids, more preferably at least 35 amino acids, and most 
preferably, the full-length polypeptide. For nucleic acid molecules, the 
length of comparison sequences is at least 50 nucleotides, preferably at 

20 least 60 nucleotides, more preferably at least 75 nucleotides or at least 

1 10 nucleotides, and most preferably, the full-length nucleic acid molecule. 
Alternatively, or additionally, two nucleic acid sequences are "substantially 
identical" if they hybridize under high stringency conditions. 

By "isolated nucleic acid molecule," "substantially pure nucleic acid 

25 molecule," or "substantially pure and isolated nucleic acid molecule" is 
meant a nucleic acid molecule (for example, DNA) that is free of the genes 
that, in the naturally occurring genome of the organism from which the 
nucleic acid molecule of the invention is derived, flank the nucleic acid. 
The term includes, for example, a recombinant DNA that is incorporated 
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into a vector; into an autonomously replicating plasmid or virus; or into the 
genomic DNA of a prokaryote or eukaryote; or that exists as a separate 
molecule (e.g., a cDNA or a genomic or cDNA fragment produced by PGR 
or restriction endonuclease digestion) independent of other sequences. It 
5 also includes a recombinant DNA that is part of a hybrid gene encoding 
additional polypeptide sequence. 

By "antisense," as used herein in reference to nucleic acid 
molecules, is meant a molecule having a nucleic acid sequence, regardless 
of length, that is complementary to at least 75 nucleotides, and preferably at 

10 least 100, 150, or 200 nucleotides, of the coding strand of a nucleic acid 
molecule encoding a prostate-specific or a testis-specific polypeptide, as 
described herein. An antisense molecule may also include regulatory 
sequences such as transcription enhancers, hormone responsive elements, 
ribosomal- and RNA polymerase binding sites, etc., which may be located 

15 upstream or downstream of the coding region, and may have a distance of 
several ten base pairs to several ten thoxisand base pairs. An antisense 
nucleic acid molecule can be, for example, capable of preferentially 
lowering the production or expression of a prostate-specific or a testis- 
specific polypeptide encoded by a prostate-specific or a testis-specific 

20 nucleic acid molecule. 

By "prostate-specific" or "testis-specific" nucleic acid molecule is 
meant a nucleic acid molecule, such as a genomic DNA, cDNA, or RNA 
(e.g., mRNA) molecule, having at least 50, 60, or 75%, more preferably at 
least 80, 85, or 95%, and most preferably at least 99% amino acid identity 

25 to the nucleic acid molecules described herein, for example, in Figures 4, 
11, and 14. In addition, a nucleic acid molecule having at least 50, 60, or 
75%, more preferably at least 80, 85, or 95%, and most preferably at least 
99% nucleotide identity to a nucleotide sequence encoding amino acids 1- 
200 of STMPl (SEQ ID NO: 14), preferably encoding amino acids 40-150 
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of STMPl, can be considered a prostate-specific or testis-specific nucleic 
acid molecule. Specifically excluded from this definition is STEAP 
(AF186249) (Hubert, R. S. et al., Proc Natl Acad Sci USA 96, 14523- 
14528, 1999) and nucleic acid molecule sequences set forth in or encoding 
5 ESTs AF132025, AP177862, BAB23615, BAA91839, BAB15559, and 
NP_032190. 

A preferred prostate-specific nucleic acid molecule may be 
preferentially expressed in prostate tissue at a level that is at least 5-fold 
higher, preferably at least 10-fold Mgher, more preferably at least 15-fold 

10 . higher, and most preferably at least 20-fold higher than the level of the 

same nucleic acid molecule in at least one non-prostate tissue, preferably in 
all other non-prostate tissues. A prostate-specific nucleic acid molecule can 
also be expressed at high levels in a non-prostate tissue although, generally, 
the level of expression will be the highest in the prostate. Occasionally, as 

1 5 described herein, a prostate-specific nucleic acid molecule wUl be 

expressed at higher levels in non-prostate tissue (e.g., placenta, lung, or 
liver) than in the prostate. 

A preferred testis-specific nucleic acid molecule may be 
preferentially expressed in testis tissue at a level that is at least 5-fold 

20 higher, preferably at least 10-fold higher, more preferably at least 15-fold 
higher, and most preferably at least 20-fold higher than the level of the 
same nucleic acid molecule in at least one non- testis tissue, preferably in 
all other non- testis tissues. A testis -specific nucleic acid molecule can 
also be expressed at high levels in a non- testis tissue although, generally, 

25 the level of expression will be the highest in the testis. Occasionally, as 
described herein, a testis -specific nucleic acid molecule will be expressed 
at higher levels in non- testis tissue (e.g., placenta, lung, or liver) than in the 
testis. 
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By "prostate-specific" or a "testis-specific" polypeptide or "prostate- 
specific" or a "testis-specific" protein is meant a polypeptide that is 
encoded by a prostate-specific or a testis-specific nucleic acid molecule. A 
prostate-specific or testis-specific polypeptide may also be defined as a 
5 polypeptide having at least 50, 60, or 75%, more preferably at least 80, 85, 
or 95%, and most preferably at least 99% amino acid identity to the 
polypeptides described herein, for example, in Figures 4, 11, and 14. 
Specifically excluded firom this definition is STEAP (AFl 86249) (Hubert, 
R. S. et Bl,,Proc Natl Acad Sci USA 96, 14523-14528, 1999) and 

10 polypeptide sequences set forth in or encoded by ESTs AF132025, 
AF177862, BAB23615, BAA91839, BAB15559, and]S[P_032190. In 
addition, a polypeptide having at least 50, 60, or 75%, more preferably at 
least 80, 85, or 95%, and most preferably at least 99% amino acid identity 
to amino acids 1-200 of STMPl (SEQ JD NO: 14), preferably amino acids 

1 5 40- 1 50 of STMP 1 , can be considered a prostate-specific or testis-specific 
polypeptide. 

A preferred prostate-specific polypeptide is preferentially expressed 
in prostate tissue at a level that is at least 5-fold higher, preferably at least 
10-fold higher, more preferably at least 15-fold higher, and most preferably 

20 at least 20-fold higher than the level of the same polypeptide in at least one 
non-prostate tissue, preferably in all other non-prostate tissues. A prostate- 
specific polypeptide can also be expressed at high levels in a non-prostate 
tissue although, generally, the level of expression will be the highest in the 
prostate. Occasionally, as described herein, a prostate-specific polypeptide 

25 will be e>q)ressed at higher levels in non-prostate (e.g., placenta, lung, Uver) 
than in the prostate, 

A preferred testis-specific polypeptide is preferentially expressed in 
testis tissue at a level that is at least 5-fold higher, preferably at least 10- 
fold higher, more preferably at least 15-fold higher, and most preferably at 
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least 20-fold higher than the level of the same polypeptide in at least one 
non- testis tissue, preferably in all other non-testis tissues, A testis-specific 
polypeptide can also be e?q)ressed at high levels in a non- testis tissue 
although, generally, the level of expression will be the highest in the testis. 
5 Occasionally, as described herein, a testis-specific polypeptide will be 
expressed at higher levels in non- testis (e.g., plac^ta, lung, liver) than in 
. the testis. 

The term prostate-specific or testis-specific polypeptide includes 
homologs, analogs, firagments, and isoforms, e.g., alternatively spliced 

10 isoforms, of the sequences described herein. By ^^biologically active 

fragment" is meant a polypeptide fragment of a prostate-specific or testis- 
specific polypeptide that exhibits, for example, extracellular trafficking, cell 
signaling, or other properties that are at least 30%, preferably at least 50%, 
more preferably at least 75%, and most preferably at least 100%, compared 

15 with the properties of a fiill length prostate-specific or testis-specific 

polypeptide. By "analog" is meant any substitution, addition, or deletion in 
the amino acid sequence of a prostate-specific or testis-specific polypeptide 
that exhibits properties that are at least 30%, preferably at least 50%, more 
preferably at least 75%, and most preferably at least 100%, compared with 

20 the extracellular trafficking or cell signaling properties of the polypeptide 
from which it is derived. Fragments, homologs, and analogs can be 
generated using standard techniques, for example, solid phase peptide 
synthesis or polymerase chain reaction. For example, point mutations may 
arise at any position of the sequence from an apurinic, apyrimidinic, or 

25 otherwise structurally impaired site within the cDNA. Alternatively, point 
mutations may be introduced by random or site-directed mutagenesis 
procedures {e,g,, oligonucleotide assisted or by error prone PC31). 
Likewise, deletions and/or insertions may be introduced into the sequences, 
and preferred insertions comprise 5'- and/or 3'-fusions with a 
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polynucleotide that encodes a reporter moiety or an affinity moiety. Other 
preferred insertions comprise a nucleic acid that further includes functional 
elements such as a promoter, enhancer, hormone responsive element, origin 
of replication, transcription and translation initiation sites, etc. It should be 
5 appreciated that where insertions with one or more functional elements are 
present, the resulting nucleic acid may be linear or circular (e.g., 
transcription or expression cassettes, plasmids, etc;). 

For use in the methods of the invention, the terms "prostate-specific" 
or "testis-specific" polypeptide further include the polypeptide sequences 

10 set forth in or encoded by ESTs AF132025, AF177862, BAB23615, 
BAA91839, BAB15559, and NP_032190, but does not include STEAP, 
and a prostate-specific or testis-specific nucleic acid molecule includes the 
nucleotide sequences set forth in or encoding ESTs AF132025, AF177862, 
BAB23615, BAA91839, BAB15559, andNP_032190, but does not include 

15 STEAP. , 

By *^rostate-specific or a testis-specific gene or homolog or 
firagment thereof is meant a gene, or homolog of a gene, that encodes a 
prostate-specific or testis-specific polypeptide. 

By "specifically binds" is meant a compound, e.g., an antibody, that 

20 recognizes and binds a protein or polypeptide, for example, a prostate- 
specific or a testis-specific polypq)tide, and that when detectably labeled 
can be competed away for binding to that protein or polypeptide by an 
excess of compound that is not detectably labeled. A compound that non- 
specifically binds is not competed away by excess detectably labeled 

25 compound. A preferred antibody binds to any prostate-specific or a testis- 
specific polypeptide sequence that is substantially identical to any of the 
polypeptide sequences set forth in Figures 4, 1 1, and 14, or encoded by any 
of the nucleotide sequences set forth in Figures 3, 4, 1 1, and 14, or portions 
thereof 
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By a "compoxmd," *test compoxmd/' or "candidate compound" is 
meant a molecule, be it naturally-occurring or artificially-derived, and 
includes, for example, peptides, proteins, synthetic organic molecules, 
naturally-occurring organic molecules, nucleic acid molecules, and 
5 components thereof. 

By "high stringency conditions" is meant conditions that allow 
hybridization comparable with the hybridization that occurs using a DNA 
probe of at least 500 nucleotides in length, in a buffer containing 0.5 M 
NaHP04, pH 7.2, 7% SDS, 1 mM EDTA, and 1% BSA (fraction V), at a 

10 temperature of 65°C, or a buffer containing 48% formamide, 4.8X SSC, 0.2 
M Tris-Cl, pH 7.6, IX Denhardt's solution, 10% dextran sulfate, and 0.1% 
SDS, at a temperature of 42°C (these are typical conditions for high 
stringency Northern or Southern hybridizations). High stringency 
hybridization is also relied upon for the success of numerous techniques 

1 5 routinely performed by molecular biologists, such as high stringency PGR, 
DNA sequencing, single strand conformational polymorphism analysis, and 
in hybridization. In contrast to Northern and Southem hybridizations, 
these techniques are usually performed with relatively short probes (e.g., 
usually 16 nucleotides or longer for PGR or sequencing, and 40 nucleotides 

20 or longer for in situ hybridization). The high stringency conditions used in 
these techniques are well known to those skilled in the art of molecular 
biology, and may be found, for example, in Ausubel et al., Current 
Protocols in Molecular Biology^ John Wiley & Sons, New York, NY, 1998, 
hereby incorporated by reference. 

25 By "probe" or 'primer" is meant a single-stranded DNA or RNA 

molecule of defined sequence that can base pair to a second DNA or RNA 
molecule that contains a complementary sequence ("target' *). The stabiUty 
of the resulting hybrid depends upon the extent of the base pairing that 
occurs. This stability is affected by parameters such as the degree of 
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complementarity between the probe and target molec\ile, and the degree of 
stringency of the hybridization conditions. The degree of hybridization 
stringency is affected by parameters such as the temperature, salt 
concentration, and concentration of organic molecules, such as formamide, 
5 and is determined by methods that are well known to those skilled in the 
art. Probes or primers specific for prostate-specific or a testis-specific 
nucleic acid molecules, preferably, have greater than 45% sequence 
identity, more preferably at least 55-75% sequence identity, still more 
preferably at least 75-85% sequence identity, yet more preferably at least 

10 85-99% sequence identity, and most preferably 100% sequence identity to 
the nucleic acid sequences encoding tibie amino acid sequences described 
herein. Probes can be detectably-labeled, either radioactively or non- 
radioactively, by methods that are well-known to those skilled in the art. 
Probes can be used for methods involving nucleic acid hybridization, such 

15 as nucleic acid sequencing, nucleic acid amplification by the polymerase 
chain reaction, single stranded conformational polymorphism (SSCP) 
analysis, restriction fragment polymorphism (RFLP) analysis, Southern 
hybridization, northern hybridization, in situ hybridization, electrophoretic 
mobility shift assay (EMSA), and other methods that are well known to 

20 those skilled in the art. 

A molecule, eg-., an oligonucleotide probe or primer, a gene or 
fragment thereof, a cDNA molecule, a polypeptide, or an antibody, can be 
said to be "detectably-labeled" if it is marked in such a way that its 
presence can be directiy identified in a sample. Methods for detectably- 

25 labeling molecules are well known in the art and include, without 

limitation, radioactive labeling (eg., with an isotope, such as ^^P or ^^S) and 
nonradioactive labeling (eg., with a fluorescent label, such as fluorescein, 
or by generating a construct containing green fluorescent protein (GFP)). 
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By 'transgenic" is meant any cell that includes a DNA sequence or 
transgene that is inserted by artifice into a cell and becomes part of the 
genome of the organism that develops firom that cell. As xxsed hereiu, the 
transgenic organisms are generally transgenic mammals (e.g., mice, rats, 
5 and goats) and the DNA (transgene) is inserted by artifice into the nuclear 
genome. By '*transgene" is meant any piece of DNA that is inserted by 
artifice into a cell, and becomes part of the genome of the organism that 
develops firom that cell. Such a transgene may include a gene that is partly 
or entirely heterologous (i.e., foreign) to the transgenic organism, or may 

10 represent a gene homologous to an endogenous gene of the organism. By 
'Taiockout mutation" is meant an artificially induced alteration in the 
nucleic acid sequence (created via recombinant DNA technology or 
deliberate exposure to a mutagen) that reduces the biological activity of the 
polypeptide normally encoded therefrom by at least 80% relative to the 

15 unmutatied gene. The mutation may, without limitation, be an insertion, 
deletion^ fraineshift mutation, or a missense mutation. The knockout 
mutation can be in a cell ex vivo (e.g., a tissue culture cell or a primary cell) 
or in vzvo. A *Tcnockout animal" is a mammal, preferably, a mouse, 
containing a knockout mutation as defined above. 

20 By "sample" is meant a tissue biopsy, cells, blood, serum, urine, 

stool, or other specimen obtained from a patient or test subject. The sample 
is analyzed to detect a mutation in a gene encoding a prostate-specific or a 
testis-specific polypeptide, or expression levels of a gene encoding a 
prostate-specific or a testis-specific polypeptide, as for example, an 

25 indication of tiie progression of cancer, by methods that are known in the 
art or described herein. For example, methods such as sequencing, single- 
strand conformational polymorphism (SSCP) analysis, or restriction 
fragment length polymorphism (RFLP) analysis of PGR products derived 
from a patient sample may be used to detect a mutation in a gene encoding 
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a prostate-specific or a testis-specific polypeptide; ELIS A may be used to 
measure levels of a prostate-specific or a testis-specific polypeptide; and 
PGR may be used to measure the level of nucleic acids encoding a prostate- 
specific or a testis-specific polypeptide. 
5 By **phaimaceutically acceptable carrier" is meant a carrier that is 

physiologically acceptable to the treated mammal while retaining the 
therapeutic properties of the compound with which it is administered. One 
exemplary pharmaceutically acceptable carrier is physiological saline 
solution. Other physiologically acceptable carriers and their formulations 

10 are known to one skilled in the art and described, for example, in 

Remington: The Science and Practice of Pharmacy, (19* edition), ed. A. 
Gennaro, 1995, Mack Publishing Company, Easton, PA. 

'Therapeutically effective amounf ' as used herein in reference to 
dosage of a medication, refers to the administration of a specific amount of 

15 a pharmacologically active agent (e.g., a prostate-specific or a testis- 
specific polypeptide, nucleic acid molecule, or modulatory compoimd) 
tailored to each individual patient manifesting symptoms characteristic of a 
specific disorder. For example, a patient receiving the treatment of the 
present invention might have prostate cancer. A person skilled ia the art 

20 will recognize that the optimal dose of a pharmaceutical agent to be 
administered will vary firom one iadividual to another. Dosage in 
individual patients should take into account the patients height, weight, rate 
of absorption and metabolism of the medication in question, the stage of the 
disorder to be treated, and what other pharmacological agents are 

25 administered concurrently. 

By 'treating " or "treatment" is meant the medical management of a 
patient with the intent to cure, ameUorate, stabilize, or prevent a disease, 
pathological condition, or disorder. This term includes active treatment, 
that is, treatment directed specifically toward the improvement or 
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associated with the cure of a disease, pathological condition, or disorder, 
and also includes causal treatment, that is, treatment directed toward 
removal of the cause of the associated disease, pathological condition, or 
disorder. In addition, this term includes palliative treatment, that is, 
5 treatment designed for the relief of symptoms rather than the curing of the 
disease, pathological condition, or disorder; preventative treatment, that is, 
treatment directed to minimizing or partially or completely inhibiting the 
development of the associated disease, pathological condition, or disorder; 
and supportive treatment, that is, treatment employed to supplement 
10 another specific therapy directed toward the improvement of the associated 
disease, pathological condition, or disorder. The phrase "treatment" also 
includes symptomatic treatment, that is, treatment directed toward 
constitutional symptoms of the associated disease, pathological condition, 
or disorder. 

1 5 By "disorder of the prostate or testis" is meant a disturbance ,of 

function and/or structure of the prostate or testis in a Uvmg organism, 
resulting from an external source, a genetic predisposition, a physical or 
chemical trauma, or a combination of the above. Such disorders include the 
proliferation of prostate or testicular cells. By "cell proliferation" is meant 

20 the growth or reproduction of similar cells, and the invention provides 
reagents for inhibiting proliferation and stimulating proliferation. By 
"iohibiting proliferation" is meant the decrease in the number of similar 
cells by at least 10%, more preferably by at least 20%, and most preferably 
by at least 50%. By "stimulatiag proliferation" is meant an increase in the 

25 number of similar cells by at least 1 0%, more preferably by at least 20%, 
and most preferably by at least 50%. 

The reagents described herein, for example, vectors expressing 
antisense, antagonists, or inhibitors of prostate-specific or testis-specific 
polypeptides or nucleic acid molecules may be used, for example, to 
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si5)press the excessive proliferation of prostate or testicular cells. Blockiiig 
prostate-specific or testis-specific polypeptide or nucleic acid molecule 
expression or activity in prostate or testicular cells can alter molecular 
pathways within cancerous cells and thus trigger apoptosis, i.e., the process 
5 of cell death where a dying cell displays a set of well-characterized 
biochemical haUmarks which include c3^olemmal blebbing, cell soma 
shrinkage, chromatin condensation, and DNA laddering. 

Disorders of the prostate or testis include prostate cancer, benign 
prostatic hyperplasia, acute prostatitis, testicular cancer, developmental 

10 defects of the prostate or testis (such as cryptorchidism or undescended 
testis, and retractile, ascending, or vanished testis). 

By ^'proliferative disease" is meant a disease that is caused by or 
results in inappropriately high levels of cell division, inappropriately low 
levels of apoptosis, or both. For example, cancers such as prostate cancer, 

1 5 testicular cancer, lymphoma, leukemia,,melanoma, ovarian cancer, breast 
cancer, pancreatic cancer, liver cancer, and lung cancer are all examples of 
proliferative disease. ^ . . ^ . 

By ''modulate" or "modulating"is meant changing, either by 
decrease or increase, the e;q)ression or biological activity of a prostate- 

20 specific or testis-specific nucleic acid molecule or polypeptide, as described 
herein. It will be appreciated that the degree of modulation provided by a 
modulating compoimd in a given assay will vary, but that one skilled in the 
art can determine the statistically significant change in the level of 
biological activity that identifies a compound that modulates a prostate- 

25 specific or testis-specific nucleic acid molecule or polypeptide. 

The invention provides several advantages. For example, it provides 
methods and reagents that can be used in the diagnosis and treatment of 
prostate and testis associated diseases, as well as other disorders and 
conditions that are sensitive to the bioactivities of the reagents (e.g.. 
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polypeptides, nucleic acid molecules, antibodies) described hereia. Since 
the prostate-specific or testis-specific polypeptides of the invention have 
been found to be highly expressed in the prostate and testis, these 
polypeptides can also be used in screens for therapeutics to treat disorders 
5 associated with the prostate and testis. These polypeptides are also 

expressed in other tissues, and can be used as therapeutics and diagnostics 
for cell proliferative disorders. 

Other features and advantages of the invention will be q)parent from 
the detailed description of the invention, the drawings, and the claims. 

10 

Brief Description of The Drawings 
Figure 1 shows an exemplary reverse northern analysis of several 
clones from a prostate specific cDNA library. 

Figure 2 shows an exemplary multiple tissue northern blot 
1 5 Figure 3 is a table showing the nucleotide sequences of twelve 

clones (SEQID NOs: l-12)'isolated from prostate tissue and LNCaP cells . 

Figure 4A is a schematic diagram showing the STMPl gene 
structure. ' 

Figure 4B shows the nucleotide sequence, including the intron 
20 junction sequences (SEQ ID NO: 13), and predicted amino acid sequence 
(SEQ ID NO: 14) of STMPl, 

Figure 4C shows the nucleotide sequences of the exons and 3' UTR 
of STMPl (SEQ ID NOs: 15-21). 

Figure 4D shows the nucleotide sequence of the ORF of STMPl 
25 (SEQ ID NO: 22). 

Figure 4E shows the shows the cDNA sequence (SEQ ID NO: 23), 
and predicted amino acid sequence (SEQ ID N0:14) of STMPl. 

Figure 4F shows the nucleotide sequences of the exons and 3' UTR 
of STMPl 0RF2 (SEQ ID NOs: 17-20 and 24-26). 
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Figure 4G shows the nucleotide sequence of the ORF of STMPl 
0RF2 (SEQIDNO: 27). 

Figure 4H shows the cDNA sequence (SEQ ID NO: 28), and 
predicted amino acid sequence (SEQ ID NO: 29) of STMPl 0RF2. 
5 Figure 41 shows the nucleotide sequences of the exons and 3 ' UTR 

of STMPl 0RF3 (SEQ ID NOs: 17-19 and 24-26). 

Figure 4 J shows the nucleotide sequence of the ORF of STMPl 
0RF3 (SEQIDNO: 30). 

Figure 4K shows the cDNA sequence (SEQ ID NO: 3 1), and 
10 predicted amino acid sequence (SEQ ID NO: 32) of STMPl 0RF3, 

Figure 4L shows the cDNA sequence (SEQ ID NO: 33), and 
predicted amino acid sequence (SEQ ID NO:34) of STMP2. 

Figure 4M shows the cDNA sequence (SEQ ID NO: 35), and 
predicted amino acid sequence (SEQ ID NO: 36) of STMP3. 
15 Figure 5 shows a sequence aliment of STMPl (SEQ ID NO: 14), 

with STEAP (SEQ ID NO: 37, Accession No. AF186249), and two ESTs 
(Accession No. BAA91839 and Accession No. BAB 15559; SEQ ID NOs: 
38 and 39, respectively). 

Figure 6A shows a multiple tissue Northem blot probed with STMPl 
20 otG3PDHcDNA. 

Figure 6B is a Northem blot probed with STMPl and PSA in the 
androgen-responsive prostate cancer cell line LNCaP and in the CWR22 
human prostate cancer xenograft model. 

Figure 6C is a Northem blot probed with STMPl and NKX3A in 
25 LNCaP, PC-3, and DU-145 cell Unes and in the CWR22R human prostate 
cancer xenograft model. 

Figure 7 A shows fluorescence microscopy images of COS- 1 cells 
transiently transfected with GFP-STMPl. 
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Figure 7B shows fluorescence microscopy images of COS- 1 cells 
transiently transfected with GFP-STMPl and labeled with antibodies 
against Golgi markers. 

Figure 8 shows fluorescence microscopy images of COS-1 cells 
5 transiently transfected with GFP-STMP 1 and observed by live-cell confocal 
microscopy. 

Figure 9 shows fluorescence microscopy images of COS-1 cells 
transiently transfected with GFP-STMPl and labeled with an antibody 
against an early endosomal marker. 
10 Figure 10 is a schematic diagram showing the SSH9 gene structure 

and two mRNA species transcribed from the SSH9 gene. 

Figure 1 1 A shows the cDNA (SEQ ID NO: 40) and predicted amino 
acid sequence (SEQ ID NO: 41) for SSH9. 

Figure 1 IB shows the predicted promoter sequence for SSH9 (SEQ 
15. ID NO: 42). 

i ; : Figure 1 IC shows the predicted intron-exon boundaries for SSH9 
1 (SEQ ID NOs: 43-50). 

Figure 12A is a Northem blot probed with SSH9 in the androgen- 
responsive prostate cancer cell line LNCaP cells and in the CWR22 human 
20 prostate cancer xenograft model. 

Figure 12B is a Northem blot probed with SSH9 in LNCaP, PC-3, 
and DU-145 cell lines, and CWR22R human prostate cancer xenograft 
model. 

Figure 12C is a multiple tissue Northem blot probed with SSH9 or 
25 GAPDHcVNA. 

Figure 13 is a schematic diagram showing the PSL22 gene structure. 
Figure 14A shows the nucleotide sequence of the ORF of PSL22 
(SEQ ID NO: 51). 
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Figure 14B shows the cDNA sequence (SEQ ID NO: 52), and 
predicted amino acid sequence (SEQ ID NO: 53) of PSL22. 

Figure 14C shows the nucleotide sequences of the TATA promoter 
and transcription start site, exons, and 5' and 3' UTRs of PSL22 (SEQ ID 
5 NOs: 54-70). 

Figure 15 shows a sequence alignment of PSL22 (RhoBP) (SEQ ID 
NO: 53), with ESTs NP032190 (mRhoph), AF132025 (dRhoph), and 
BAB23615 (SEQ ID Nos:71-73), 

Figure 16A is a Northern blot probed with PSL22 in LNCaP, PC-3, 
10 and DU-145 cell lines, and in the CWR22R human prostate cancer 
xenograft model. 

Figure 1 6B is a multiple tissue Northern blot probed with PSL22 
cDNA. 



15 Detailed Description of the Invention 

The basic biology of the normal prostate and testis, as well as 
prostate and testicular cancer initiation and progression is still poorly 
imderstood. It is therefore necessary to delineate the molecular events that 
are at the basis of these processes. To achieve this goal, we have 

20 identified, cloned, and characterized highly prostate- and testis-enriched 
genes whose gene products have important roles for both the normal 
physiology and the pathophysiology of the prostate and the testis. These 
gene products also have important roles in other disorders, for example, 
heart, brain, liver, pancreas, kidney, and colon, which are the tissues where 

25 variable low expression, and occasionally, very high expression of specific 
gene products, can be detected by Norfhem analysis. 

The invention provides prostate-specific or testis-specific 
polypeptides and nucleic acid molecules (see below), and diagnostic and 
therapeutic methods employing these polypeptides and nucleic acid 
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molecules. The invention also provides methods for identifying 
compounds that modulate the biological activities of prostate-specific or 
testis-specific polypeptides and nucleic acid molecules, and therapeutic 
methods employing these compounds. The diagnostic, therapeutic, and 
5 screening methods of the invention are first described, followed by general 
approaches that can be used in carrying out these methods. Finally, 
experimental resTxlts supporting the methods of the invention are described. 

Bioassays 

Prostate-specific and testis-specific polypeptides are expressed in the 

10 prostate and testis, and also in other tissues such as kidney, pancreas, liver, 
lung, and colon. The expression patterns of prostate-specific and testis- 
specific polypeptides in specific cells and tissues are used to identify 
cellular targets of prostate-specific and testis-specific polypeptide actions, 
and to identify bioactivities that are relevant to specific prostate- and testis- 

1 5 related diseases, such as prostate cancer, testicular cancer, benign prostatic 
hyperplasia, acute prostatitis, and developmental testis defects. 

Therapeutic and diagnostic utilities for prostate-specific and testis- 
specific polypeptides are identified by, for example, conducting bioassays 
in vitro. Culture systems that reflect prostate-specific and testis-specific 

20 polypeptide expression patterns, along with the distribution of particular 
receptors, such as the androgen receptor, are selected. For example, 
LNCaP cells express androgen receptors, and respond to one or more 
isoforms of prostate-specific and testis-specific polypeptides in a variety of 
bioassays. The activities of prostate-specific and testis-specific 

25 polypeptides (e.g., STMPl, SSH9, PSL22) are compared, using sister 
cultures, in various dose-response assays, including but not limited to, 
inhibition of proliferation, apoptosis, signaling events (e.g. changes in 
kinase activity), changes in transcription factor activity (such as that of the 
androgen receptor), intracellular trafficking, or cell signaling. The relative 
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potencies of the prostate-specific and testis-specific polypeptides are 
determined on the basis of, for example, protein concentration. 

Diagnostic Methods Employing Prostate-Specific Or Testis-Specific 
5 Nucleic Acid Molecules, Polypeptides, and Antibodies 

Prostate-specific or testis-specific nucleic acid molecules, 
polypeptides, and antibodies are used in methods to diagnose or monitor a 
variety of diseases and conditions, including those involving mutations in, 
or inappropriate expression of, prostate-specific or testis-specific genes. 
10 Prostate-specific or testis-specific expression has been docimiented in a 
variety of tissues, as discussed above. Thus, detection of abnormalities in 
prostate-specific or testis-specific genes or their expression is used in 
methods to diagnose, or to monitor treatment or development of diseases of 
these tissues. 

15 The diagnostic methods of the invention are used, for example, with 

patients that have a prostate-related or testis-related disease, for example, 
prostate or testicular cancer, in an effort to determine its etiology, and thus, 
to facilitate selection of an appropriate course of treatment The diagnostic 
methods are also used with patients that have not yet developed a prostate- 

20 related or testis-related disease, but who may be at risk of developing such 
a disease, or with patients that are at an early stage of developing such a 
disease. Many prostate-related or testis-related diseases occur during 
development, and thus, the diagnostic methods of the invention are also 
carried out on a fetus or embryo during development. Also, the diagnostic 

25 methods of the invention are used in prenatal genetic screening, for 

example, to identify parents who may be carriers of a recessive prostate- 
related or testis-related mutation. 

Prostate-specific or testis-specific abnormalities that are detected 
using the diagnostic methods of the invention include those characterized 
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by, for example, (i) abnoimal prostate-specific or testis-specific 
polypeptides, (ii) prostate-specific or testis-specific genes containing 
mutations that result in the production of such polypeptides, and (iii) 
mutations that result in production of abnormal amounts of prostate- 
5 specific or testis-specific polypeptides. 

Levels of prostate-specific or testis-specific expression in a patient 
sample are determined by using any of a number of standard techniques 
that are well known in the art. For example, prostate-specific or testis- 
specific expression in a biological sample (e.g., a blood, prostate or testis 

10 tissue sample, or amniotic fluid) from a patient is monitored by standard 
northem blot analysis or by quantitative PGR (see, eg., Ausubel et al. 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 
NY, 1998; PCR Technology: Principles and Applications for DNA 
Amplification^ H.A. Ehrlich, Ed., Stockton Press, NY; Yap et aL NucL 

15 Acids. Res A9A294, 1991): 

A biological sample obtained from a patient can be analyzed for one 
or more mutations in prostate-specific or testis-specific nucleic acid 
molecules using a ncdsmatch detection approach. Generally, this approach 
involves PCR amplification of nucleic acid molecules from a patient 

20 sample, followed by identification of a mutation a mismatch) by 
detection of altered hybridization, aberrant electrophoretic gel migration, 
binding, or cleavage mediated by mismatch binding proteins, or by direct 
nucleic acid molecule sequencing. Any of these techniques can be used to 
faciUtate detection of mutant prostate-specific or testis-specific genes, and 

25 each is well known in the art Examples of these techniques are described, 
for example, by Orita et al (Proc. Natl. Acad. Set USA Seinee-mO, 
1989) and Sheffield et al. (Proa Natl. Acad Sci. USA 86:232-236, 1989). 

Mismatch detection assays also provide an opportunity to diagnose a 
prostate-specific or testis-specific gene-mediated predisposition to a disease 
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before the onset of symptoms. For example, a patient heterozygous for a 
prostate-specific or testis-specific mutation that suppresses normal prostate- 
specific or testis-specific biological activity or expression may show no 
clinical symptoms of a prostate-specific or testis-specific gene-related 
5 disease, and yet possess a higher than normal probability of developing a 
prostate or testiciilar disease. Given such a diagnosis, patients can take 
precautions to nadboinaize their exposure their exposure to adverse 
environmental factors and to carefully monitor their medical condition (for 
example, through frequent physical examinations). As mentioned above, 

10 this type of diagnostic approach can also be used to detect prostate-specific 
or testis-specific mutations in prenatal screens. 

The prostate-specific or testis-specific diagnostic assays described 
above can be carried out using any biological sample (for example, a blood, 
prostate, or testis tissue sample, or amniotic fluid) in which aprostate- 

15 specific or testis-specific polypeptide or nucleic acid molecule is normally 
expressed. A mutant prostate-specific or testis-specific gene can also be 
identified using these sources as test samples. Altematively, a prostate- 
specific or testis-specific mutation, as part of a diagnosis for predisposition 
to a prostate-specific or testis-specific gene-associated disease, can be 

20 tested for using a DNA sample from any cell, for example, by mismatch 
detection techniques. Preferably, the DNA sample is subjected to PGR 
amplification prior to analysis. 

In yet another diagnostic approach of the invention, an immunoassay 
is used to detect or monitor prostate-specific or testis-specific protein 

25 expression ia a biological sample. Anti-prostate-specific or testis-specific- 
polypeptide polyclonal or monoclonal antibodies (as described below) can 
be used in any standard immunoassay format (e.g., ELISA, Westem blot, or 
RIA; see, e.g., Ausubel et aly supra) to measure prostate-specific or testis- 
specific polypeptide levels. These levels are compared to wild-type 
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prostate-specific or testis-specific levels. For example, an increase in 
prostate-specific or testis-specific polypeptide production may be indicative 
of a condition or a predisposition to a condition involving overexpression 
of prostate-specific or testis-specific polypeptide biological activity, such as 
5 late stage prostate cancer. 

Immunohistocliemical techniques can also be utilized for prostate- 
specific or testis-specific polypeptide detection. For example, a tissue 
sample can be obtained from a patient, sectioned, and stained for the 
presence of prostate-specific or testis-specific polypeptide usiag an anti- 

10 prostate-specific or testis-specific antibody (see below) and any standard 
detection system one that includes a secondary antibody conjugated to 
horseradish peroxidase). General guidance regarding such techniques can 
be found in, eg-., Bancroft et al, Theory and Practice of Histological 
Techniques, Churchill Livingstone, 1982, and Ausubel et al, supra. 

15 hi a preferred example, a combiaed diagnostic method can be 

employed that iacludes an evaluation of prostate-specific or testis-specific 
protein production (for example, by immunological techniques or the 
protein truncation test (Hogerrorst et aL, Nature Genetics 10:208-212, 
1995), and a nucleic acid molecule-based detection technique designed to 

20 identify more subtle prostate-specific or testis-specific mutations (for 
example, point mutations). As described above, a number of mismatch 
detection assays are available to those skilled in the art, and any preferred 
technique can be used. Mutations in prostate-specific or testis-specific 
genes can be detected that either result m loss or gain of prostate-specific or 

25 testis-specific polypeptide or nucleic acid molecule expression or loss or 
gain of normal prostate-specific or testis-specific polypeptide or nucleic 
acid molecule biological activity. 

Prostate-specific or testis-specific polypeptides or nucleic acid 
molecules can be used to conelate the course of prostate cancer to a marker 
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other than PSA, to monitor the coxirse of an anticancer therapy, or to detect 
a neoplastic cell in a system. For example, a predetermined quantity of an 
RNA encoding a prostate-specific or testis-specific polypeptide is 
correlated with the presence of a neoplastic cell, for example, firom a 
5 biopsy. The total RNA is extracted from the biopsy specimen, and a real 
time quantitative rt-PCR employing individual reactions with primer pairs 
specific to prostate-specific or testis-specific sequences is performed in 
parallel with a biopsy specimen known to be free of cancer cells. Biopsy 
specimens are deteraiined to have a cancer cell, where the detected 
10 prostate-specific or testis-specific niElNA quantity is at least 5 times higher 
than in the control specimen. An exemplary extraction of total RNA 
utilizes the Quiagen BioRobot kit in conjunction with the BioRobot 9600 
system, and the real time rfPCR is performed in a Perkin Ehner ABI Prism 
7700. 

1 5 ^ ' In altemative aspects of the inventive subject matter, the method of 
: detecting a neoplastic cell need not be limited to biopsy tissues from 
prostate or testis tissue, but may employ various altemative tissues, 
including lymphoma tumor cells, and various solid tumor cells, so long as 
such tumor cells overproduce mRNA of prostate-specific or testis-specific 

20 polypeptides. Appropriate altemative tumor cells can readily be identified 
by the above described method. Likewise, the system need not be restricted 
to a mammal, but may also iaclude cell-, and tissue cultures grown in vitro, 
and tumor cells and specimens from animals other than mammals. For 
example, tumor cell and tissue grown in vitro may advantageoxisly be 

25 utilized to investigate drug action on such cells, and sequences encoding 
prostate-specific or testis-specific polypeptides may convenientiy be 
employed as tumor marker. Altematively, body fluids serum, saliva, 
etc.) that may or may not contain tumor cells are also contemplated a 
suitable substrate for the method presented herein, so long as they contain 
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to at least some extent inRNA encoding a prostate-specific or testis-specific 
polypeptide. 

In still other aspects of contemplated methods, the polypeptide 
quantity need not necessarily be limited to at least 5 times more than the 
5 control specimen in order to establish that the tissue has a cancer cell. For 
example, where the concentration of the polypeptide is hormone dependent, 
amoimts between 3-8 fold and more may be appropriate. In contrast, where 
the concentration of cancer cells in the biopsy specimen is relatively low, 
amoimts of less than 5-fold, including 1.5 to 4.9-fold and less are 

10 contemplated. 

The detection process may include fluorescence detection, 
lunMnescence detection, scintigraphy, autoradiography, and formation of a 
dye. For example, for microscopic analysis of biopsy specimens, luciferase 
labeled probes are particularly advantageous in conjimction with a 

15 luminescence substrate luciferin). Linninescence quantification may 
then be performed utilizing a CCD-camera and image analysis system. 
Similarly, radioactivity may be detected via autoradiographic or 
scintigraphic procedures on a tissue section, in a fluid or on a soUd support. 
Where the probe is a natural or synthetic hgand of a prostate-specific or 

20 testis-specific polypeptide, the ligand may include molecules with a chemi- 
cal modification that increase the affinity to the polypeptide and/or induce 
irreversible binding to the polypeptide. For example, transition state 
analogs or suicide inhibitors for a particular reaction catalyzed by the 
polypeptide are especially contemplated. Labeling of antibodies, antibody 

25 fragments, small molecules, and binding of the labeled entity is a technique 
that is well known in the art, and all known methods are generally suitable 
for use in conjunction with methods contemplated herein. Furthermore, the 
probe need not be limited to a fluorescein labeled antibody, and alternative 
probes include antibody fragments Fab, Fab*, scFab, etc.). 



30 



wo 01/72962 



PCT/USOl/09410 



Still further contemplated variations include substitution of one or 
more atoms or chemical groups in the sequence with a radioactive atom or 
group. For example, where cDNAs are employed as a hybridization- 
specific probes, a fluorophor or eirzyme {e,g., P-galactosidase for 
5 generation of a dye, or luctferase for generation of luminescence) may be 
coupled to the sequence to identify position and/or quantity of a comple- 
mentary sequence. Alternatively, where contemplated cDNA molecules are 
utilized for affinity isolation procedures, the cDNA may be coxipled to a 
molecule that is known to have a high-affixiity (z.e., K<i<10"^mor^) partner, 
10 such as biotin, or an oligo-histidyl tag. In another example, one or more 
phosphate groups may be exchanged for a radioactive phosphate group with 
a ^^P or ^^P isotope to assist in detection and quantification, where the 
radiolabeled cDNA is employed as a hybridization probe. 

15 Therapeutic Methods Employing Prostate-Specific Or Testis-Specific 
Nucleic Acid Molecules, Polypeptides, and Antibodies 

The invention includes methods of treating or preventing prostate- 
specific or testis-specific diseases. Therapies are designed to circmnvent or 
overcome a prostate-specific or testis-specific gene defect, or inadequate or 

20 excessive prostate-specific or testis-specific gene expression, and thus 
modulate and possibly alleviate conditions iavolviug defects in prostate- 
specific or testis-specific genes or proteins. In considering various 
therapies, it is imderstood that such therapies are, preferably, targeted to the 
affected or potentially affected organs, for example, the prostate or the 

25 testis. Reagents that are used to modulate prostate-specific or testis- 
specific biological activity can include, without limitation, full length 
prostate-specific or testis-specific polypeptides; prostate-specific or testis- 
specific cDNA, niRNA, or antisense RNA; prostate-specific or testis- 
specific antibodies; and any compound that modxilates prostate-specific or 
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testis-specific polypeptide or nucleic acid molecule biological activity, 
expression, or stability. 

Treatment or prevention of diseases resulting from a mutated 
prostate-specific or testis-specific gene is accomplished, for example, by 
5 replacing a mutant prostate-specific or testis-specific gene with a normal 
prostate-specific or testis-specific gene, administering a normal prostate- 
specific or testis-specific gene, modulating the fimction of a mutant 
prostate-specific or testis-specific protein, delivering normal prostate- 
specific or testis-specific protein to the appropriate cells, or alteriag the 
10 levels of normal or mutant prostate-specific or testis-specific protein. It is 
also possible to correct a prostate-specific or testis-specific gene defect to 
modify the physiological pathway an intracellular trafficking 
pathway) in which the prostate-specific or testis-specific protein 
participates. 

. : 1 5 To replace a mutant protein with normal protein, or to add protein to, 

cells that do not express sufficient or normal prostate-specific or testis- 
specific protein, it may be necessary to obtain large amounts of pure 
prostate-specific or testis-specific protein from cultured cell systems in 
which the protein is expressed (see, eg., below). Delivery of the protein to 

20 the affected tissue can then be accomplished using appropriate packaging or 
administrating systems. Alternatively, small molecule analogs that act as 
prostate-specific or testis-specific molecule agonists or antagonists can be 
administered to produce a desired physiological effect (see below). 

Gene therapy is another therapeutic q)proach for preventing or 

25 ameUorating diseases caused by prostate-specific or testis-specific gene 
defects. Nucleic acid molecules encoding wild type prostate-specific or 
testis-specific proteins can be delivered to cells that lack sufficient, normal 
prostate-specific or testis-specific biological activity (eg., cells carrying 
mutations in prostate-specific or testis-specific genes). The nucleic acid 
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molecules must be delivered to those cells in a fonn in whicli they can be 
taken up by the cells and so that sufficient levels of protein, to provide 
effective prostate-specific or testis-specijSc function, can be produced. 
Alternatively, for some prostate-specific or testis-specific mutations, it may 
5 be possible slow the progression of the resulting disease or to modulate 
prostate-specific or testis-speciSc activity by introducing another copy of a 
homologous gene bearing a second mutation in that gene, to alter the 
mutation, or to use another gene to block any negative efifect 

Transducing retroviral, adenoviral, and adeno-associated viral 

10 vectors can be used for somatic cell gene therapy, especially because of ' 
their high efficiency of infection and stable integration and expression (see, 
eg., Cayouette et al. Human Gene Therapy 8:423-430, 1997; Kido et al, 
• Current Eye Research 15:833-844, \996\B\oomsr et al. Journal of 
Virology 71:6641-6649, 1997; Naldioi et al. Science 272:263-267, 1996; 

15 and Miyoshi et alyProc. Natl Acad, Scl, USA 94:10319-1032, 1997). For 
example, the full length prostate-specific or testis-specific gene, or a 
portion thereof, can be cloned into a retroviral vector and expression can be 
driven from its endogenous promoter, from the retroviral long terminal 
repeat, or from a promoter specific for a target cell type of interest (such as 

20 aortic or other vascular cells). Other viral vectors that can be used include, 
for example, vaccinia virus, bovine papilloma virus, or a herpes virus, such 
as Epstein-Barr Virus (also see, for example, the vectors of Miller, Human 
Gene Therapy 15-14, 1990; Friedman, i'de/ice 244:1275-1281, 1989; 
EgUtis etaly BioTechniques 6:608-614, 1988; Tolstoshev et al, Current 

25 Opinion in Biotechnology 1:55-61, 1990; Sharp, The Lancet 337:1277- 
1278, 1991; Cometta et al. Nucleic Acid Research and Molecular Biology 
36:311-322, 1987; Anderson, Science 226:401-409, 1984; Moen, Blood 
Cells 17:407-416, 1991; or Miller et al. Biotechnology 7:980-990, 1989). 
Retroviral vectors are particularly well developed and have been used in 
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clinical settings (Rosenberg ei al, N. Engl. J. Med 323:370, 1990; 
Anderson et al, U.S. Patent No. 5,399,346). 

Gene transfer can also be achieved using non-viral means involving 
transfection in vitro ^ by means of any standard technique, including but not 
5 limited to, calcium phosphate, DEAE dextran, electroporation, protoplast 
fusion, and liposomes. Transplantation of normal genes into the affected 
tissues of a patient can also be accomplished by transferring a normal 
prostate-specific or testis-specific gene into a cultivatable cell type ex vivo, 
after which the cell (or its descendants) is injected into a targeted tissue. 

10 Another strategy for inhibiting prostate-specific or testis-specific function 
using gene ther^y involves intracellular expression of an anti-prostate- 
specific or testis-specific antibody or a portion of an prostate-specific or 
testis-specific antibody. For example, the gene (or gene fragment) 
encoding a monoclonal antibody that specifically binds to prostate-specific 
r ■ . 15 . or testis-specific polypeptide and inhibits its biological activity is placed 
under the transcriptional control of a tissue-specific gene regulatory 
sequence. Another ther^eutic approach involves administration of 
recombinant prostate-specific or testis-specific polypeptide, either directly 
to the site of a potential or actual disease-affected tissue (for example, by 

20 injection) or systemically (for example, by any conventional recombinant 
protein administration technique). The dosage of a prostate-specific or 
testis-specific polypeptide depends on a number of factors, including the 
size and health of the individual patient but, generally, between about 0.006 
mg/kg to about 0.6 mg/kg, inclusive, is administered per day to an adult in 

25 any pharmaceutically acceptable formulation. 

Non-viral approaches can also be employed for the introduction of 
therapeutic DNA into cells predicted to be subject to diseases involving a 
prostate-specific or testis-specific disorder. For example, a prostate- 
specific or testis-specific nucleic acid molecule or an antisense nucleic acid 
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molecule can be introduced into a cell by lipofection (Feigner et al. Proa 
Natl Acad. Sci. USA 84:7413, 1987; Ono et al, Neuroscience Letters 
17:259, 1990; Bn^ametal.Am. J. Med Set 298:278, 1989; Staubingere^ 
al. Methods inEnzymology 101:512, 1983), asialoorosomucoid-polylysine 
5 conjugation (Wu et al. Journal of Biological Chemistry 263:14621, 1988; 
Wu et al. Journal of Biological Chemistry 264:16985, 1989), or, less 
preferably, micro-injection under surgical conditions (Wolff et al. Science 
247:1465, 1990). 

Prostate-specific or testis-specific cDNA expression for use in gene 

10 therapy methods can be directed jfrom any suitable promoter (e.g., the 
human cytomegalovirus (CMV), simian virus 40 (SV40), or 
metallothionein promoters), and regulated by any appropriate mammalian 
regulatory element. For example, if desired, enhancers known to 
preferentially direct gene expression in specific cell types can be used to 

15 direct prostate-specific or testis-specific expression. The enhancers used 
can include, without limitation, those that are charaicterized as tissue- or 
cell-specific enhancers. Alternatively, if a prostate-specific or testis- 
specific genomic clone is used as a therapeutic construct (such clones can 
be identified by hybridization with prostate-specific or testis-specific 

20 cDNA, described above), regulation can be mediated by the cognate 

regulatory sequences, or, if desired, by regulatory sequences derived from a 
heterologous source, including any of the promoters or regulatory elements 
described above. 

Antisense-based strategies can be employed to explore prostate- 

25 specific or testis-specific gene fimction and as a basis for therapeutic drug 
design. These strategies are based on the principle that sequence-specific 
suppression of gene expression (via transcription or translation) can be 
achieved by intracellular hybridization between genomic DNA or mRNA 
and a complementary antisense species. The formation of a hybrid RNA 
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duplex interferes with transcription of the target prostate-specific or testis- 
specific-encoding genomic DNA molecule, or processing, transport, 
translation, or stability of the target prostate-specific or testis- specific 
mRNA molecule. 

5 Antisense strategies can be delivered by a variety of approaches. 

For example, antisense oligonucleotides or antisense RNA can be directly 
administered (eg., by intravenous injection) to a subject ia a form tiiat 
allows uptake into cells. Alternatively, viral or plasmid vectors that encode 
antisense RNA (or antisense RNA fi-agments) can be introduced into a cell 
10 in vivo or ex vivo. Antisense effects can be iaduced by control (sense) 
sequences; however, the extent of phenotypic changes are highly variable. 
Phenotypic effects induced by antisense effects are based on changes in 
criteria such as protein levels, protein activity measurement, and target 
mRNA levels. 

1 5 For example/ prostate-specific or testis-specific gene therapy can 

also be accomplished by direct administration of antisense prostate-specific 
or testis-specific mORNA to a cell that is expected to be adversely affected 
by the expression of wild-type or mutant prostate-specific or testis-specific 
polypeptides. The antisense prostate-specific or testis-specific mRNA can 

20 be produced and isolated by any standard technique, but is most readily 
produced by in vitro transcription using an antisense prostate-specific or 
testis-specific cDNA under the control of a high efficiency promoter (eg,, 
the T7 promoter). Administration of antisense prostate-specific or testis- 
specific mRNA to cells can be carried out by any of the methods for direct 

25 nucleic acid molecule administration described above. 

An alternative strategy for inhibiting prostate-specific or testis- 
specific fimction using gene therapy involves intracellular expression of an 
anti-prostate-specific or testis-specific antibody or a portion of an anti- 
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prostate-specific or testis-specific antibody. For example, the gene (or gene 
firagment) encoding a monoclonal antibody tiiat specifically binds to 
prostate-specific or testis-specific and inhibits its biological activity can be 
placed xmder the transcriptional control of a tissue-specific gene regulatory 
5 sequence. 

Another therapeutic approach within the invention involves 
administration of recombinant prostate-specific or testis-specific 
polypeptide, either directly to the site of a potential or actual disease- 
affected tissue (for example, by injection) or systemically (for example, by 

10 any conventional recombinant protein administration technique). The 
dosage of prostate-specific or testis-specific depends on a number of 
factors, including the size and health of the individual patient, but, 
generally, between 0.1 mg and 100 mg, inclusive, are administered per day 
to an adult in any pharmaceutically acceptable formulation. 

15 In a patient diagnosed as having a prostate-specific or testis-specific 

mutation gene or a prostate-specific or testis-specific disease, or as. 
susceptible to prostate-specific or testis-specific gene mutations, aberrant 
prostate-specific or testis-specific polypeptide or nucleic acid molecule 
expression (even if those mutations or expression patterns do not yet result 

20 in alterations in prostate-specific or testis-specific expression or biological 
activity), or to a prostate-specific or testis-specific disease, any of the 
above-described therapies are administered before the occurrence of the 
disease phenotype. Also, compounds shown to modulate prostate-specific 
or testis-specific polypeptide or nucleic acid molecule expression or 

25 prostate-specific or testis-specific polypeptide or nucleic acid molecule 
biological activity are administered to patients diagnosed with potential or 
actual diseases by any standard dosage and route of administration. 
Alternatively, gene ther^y using an antisense prostate-specific or testis- 
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specific mRNA expression construct is undertaken to reverse or prevent the 
gene defect prior to the development of the full course of the disease. 

The therapeutic methods of the invention are, in some cases, targeted 
to prenatal treatment. For example, a fetus found to have a prostate- 
5 specific or testis-specific mutation is administered a gene therapy vector 
including a normal prostate-specific or testis-specific gene, or administered 
a normal prostate-specific or testis-specific protein. Such treatment may be 
required only for a short period of time, or may, in some form, be required 
throughout such a patient's lifetime. Any continued need for treatment, 

10 however, is determined using, for example, the diagnostic methods 
described above. Also as discussed above, prostate-specific or testis- 
specific polypeptide or nucleic acid molecule abnormalities may be 
associated with diseases in adults, and thus, adults are subject to the 
therapeutic methods of the invention as well. 

1 5 Additionally, prostate-specific or testis-specific polypeptides may be 

used to stimulate an immune system to, assist in generating innmunity 
against, for example, prostate cancer cells. 

The methods of the present invention can be used to diagnose or 
treat the disorders described herein in any mammal, for example, humans, 

20 domestic pets, or livestock. Where a non-human mammal is treated or 
diagnosed, the prostate-specific or testis-specific polypeptide, nucleic acid 
molecule, or antibody employed is preferably specific for that species. 
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IdentijScation of Molecules that Modulate Prostate-Specific Or Testis- 
Specific Polypeptide or Nucleic Acid Molecule Biological Activity or 
Whose Biological Activity is ModiJated by Prostate-Specific Or Testis- 
Specific Polypeptides or Nucleic Acid Molecules 
5 Isolation of prostate-specific or testis-specific cDNAs (as described 

herein) also facilitates the identification of molecules that increase or 
decrease prostate-specific or testis-specific polypeptide or nucleic acid 
molecule biological activity. Similarly, molecules whose activity is 
modulated by prostate-specific or testis-specific polypeptide or nucleic acid 

10 . molecule biological activity can be identified. According to one approach, 
candidate molecules are added at varying concentrations to the culture 
medium of cells expressing prostate-specific or testis-specific mRNA. 
Prostate-specific or testis-specific biological activity is then measured using 
standard techniques. The measurement of biological activity can include, 

15 without limitation, the measurement of prostate-specific or testis-specific : 
protein and nucleic acid molecule e3q)ression levels, response to androgens, 
or intracellular localization and trafficking. 

If desired, the effect of candidate modulators on expression can also 
be measured at the level of prostate-specific or testis-specific protein 

20 production using the same general approach and standard immunological 
detection techniques, such as western blotting or immunoprecipitation with 
a prostate-specific or testis-specific-specific antibody (see below). 

A test compound that is screened in the methods described above 
can be a chemical, be it naturally-occurring or artificially-derived. Such 

25 compounds can include, for example, polypeptides, synthesized organic 
molecules, naiturally occurring organic molecules, nucleic acid molecules, 
and components thereof. Candidate prostate-specific or testis-specific 
modulators include peptide as well as non-peptide molecules (e.g., peptide 
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or non-peptide molecules found, e-g-., in a cell extract, mammalian serum, 
or growth medium in which mammalian cells have been cultured). 

Administration of Prostate-Specific Or Testis-Specific Polypeptides^ 
5 Prostate-Specific Or Testis-Specific Nucleic Acid Molecules, and 

Modulators of Prostate-Specific Or Testis-Specific Polypeptide or Nucleic 
Acid Molecule Synthesis or Function 

A prostate-specific or testis-specific protein, nucleic acid molecule, 
or modulator is administered within a pharmaceutically-acceptable diluent, 

10 carrier, or excipient, in unit dosage form to patients or experimental 
animals. Also, conventional pharmaceutical practice is employed to 
provide suitable formulations or compositions in which to administer 
neutralizing prostate-specific or testis-specific antibodies or prostate- 
specific or testis-specific-inhibiting compounds (e,g,, a prostate-specific or 

1 5 testis-specific antisense molecule or a prostate-specific or testis-specific 
dominant negative mutant) to patients suffering from a prostate-specific or 
testis-specific disease, such as prostate cancer, testicular cancer, benign 
hyperplasia of the prostate, or developmental defects of the prostate or 
testis. Administration can begin before or after the patient is symptomatic. 

20 Any appropriate route of administration can be employed, for 

example, administration can be parenteral, intravenous, intra-arterial, 
subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, 
intraventricular, intracapsular, intraspinal, intracistemal, intraperitoneal, 
intranasal, inhalation to deep limg, aerosol, by suppositories, oral, or topical 

25 {e.g. by applying an adhesive patch carrying a formulation capable of 
crossing the dermis and entering the bloodstream). Preferably, the 
administration is local to the afflicted tissue, such as prostate or testis 
tissue. Therapeutic formulations can be in the form of liquid solutions or 
suspensions; for oral administration, formulations can be in the form of 
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tablets or capsules; and for intranasal formulations, in the form of powders, 

nasal drops, or aerosols. Any of the above formulations may be a 

sustained-release formulation. 

Methods that are well known in the art for making formulations are 
5 found, for example, in Remington 's Pharmaceutical Sciences, (1 8* 

edition), ed. A. Gennaro, 1990, Mack Publishing Company, Easton, PA. 

Formulations for parenteral administration can, for example, contain 

excipients; sterile water; or saline; polyalkylene glycols, such as 

polyethylene glycol; oils of vegetable origin; or hydrogenated napthalenes. 
10 Sustained-release, biocompatible, biodegradable lactide polymer, 

lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene 

copolymers can be used to control the release of the compoxmds. Other 

potentially useful parenteral delivery systems for prostate-specific or testis- 

specific modulatory compounds include ethylene-vinyl acetate copolymer 
15 particles, osmotic pimips, implantable infusion systexns, and . H^s^f^:^ 

Formulations for inhalation can contain excipients, for example, lactose, or ^K^}- r> i r 

can be aqueous solutions containing, for example, polyoxyefhylene-9-lauryl • t 

ether, glycocholate, and deoxycholate, or can be oily solutions for 

administration in the form of nasal drops, or as a gel. 

20 

Prostate-Specific Or Testis-Specific Fragments 

Polypeptide fragments that include various portions of prostate- 
specific or testis-specific proteins are useful in identifying the domains 
important for their biological activities, such as protein-protein interactions 
25 and transcription. Methods for generating such fragments are well known 
in the art (see, for example, Ausubel et al, supra), using the nucleotide 
sequences provided herein. For example, a prostate-specific or testis- 
specific protein fragment can be generated by PGR amplifying a desired 
prostate-specific or testis-specific nucleic acid molecule fragment using 
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oligonucleotide primers designed based upon the prostate-specific or testis- 
specific nucleic acid sequences. Preferably, the oligonucleotide primers 
include unique restriction enzyme sites that facilitate insertion of the 
amplified firagment into the cloning site of an expression vector (eg., a 
5 mammalian expression vecoor, see above). This vector can then be 

introduced into a cell (eg., a mammalian cell; see above) by artifice, using 
any of the various techniques known in the art such as those described 
herein, resxilting in the production of a prostate-specific or testis-specific 
polypeptide firagment in the cell containing the expression vector, 

1 0 Prostate-specific or testis-specific polypeptide firagments (e.g. , 

chimeric fiision proteins) can also be used to raise antibodies specific for 
various regions of prostate-specific or testis-specific polypeptides. 
Preferred prostate-specific or testis-specific fiagments include, without 
limitation, fragments including the N-terminal domain of STMPl (amino 

15 acids 1-200), the P5CR domain, and firagments thereof 

Svnthesis of Prostate-Specific Or Testis-Specific Proteins, Polvpeptides, 
and Polvpeptide Fragments 

Those skilled in the art of molecular biology will understand that a 

20 wide variety of expression systems can be used to produce recombinant 

prostate-specific or testis-specific proteins. The precise host cell used is not 
critical to the invention. The prostate-specific or testis-specific proteins can 
be produced in a prokaryotic host (eg., E, coli) or in a eukaryotic host (eg., 
5. cerevisiae, insect cells such as Sf9 cells, or mammalian cells such as 

25 COS, NIH 3T3, CHO, or HeLa cells). These cells are commercially 
available from, for example, the American Type Culture Collection, 
Rockville, MD (see also Ausubel et al. Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, NY, 1998). The method of 
transformation and the choice of expression vehicle (eg., e>q)ression 
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vector) will depend on the host system selected. Transformation and 
transfection methods are described, in Ausubel et al. Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1998, 
and e5q)ression vehicles can be chosen from those provided, in Pouwels 
5 et aly Cloning Vectors: A Laboratory Manual, 1985, Supp. 1987). 

The characteristics of prostate-specific or testis-specific nucleic acid 
molecules are analyzed by introducing such genes into various cell types or 
using in vitro extracellular systems. The function of prostate-specific or 
testis-specific proteins produced in such cells or systems are then examined 

10 under different physiological conditions. Also, cell lines can be produced 
that over-express the prostate-specific or testis-specific gene product, 
allowing purification of prostate-specific or testis-specific proteins for 
biochemical characterization, large-scale production, antibody production, 
and patient therapy. 

1 5 The polypeptides of the invention may be produced in vivo or in 

vitro, and may be chemically and/or enzymatically modified. The 
polypeptides can be isolated from prostate tissue or prostate cancer cells 
that may or may not be in a hormone dependent state. Alternatively, and 
especially where larger amounts (z.e., >lQmg) are desirable, recombinant 

20 production (e.g,, in a bacterial, yeast, insect cell, or mammalian cell system) 
may advantageously be employed to generate significant quantities of 
prostate-specific or testis-specific polypeptides. 

Furthermore, recombinant production not only offers a more eco- 
nomical strategy to produce the polypeptides of the invention, but also 

25 allows specific modification in the amino acid sequence and composition to 
tailor particular biochemical, catalytic and physical properties. For 
example, where increased solubility of is desirable, one or more 
hydrophobic amino acids may be replaced with hydrophilic amino acids. 
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Alternatively, where reduced or increased catalytic activity is required, one 
or more amino acids may be replaced or eliminated. 

In still another example, the polypeptides of the invention can be 
synthesized as fusion proteins including, for example, fusions with en- 
5 zymatically active partners (e.g. , for dye formation or substrate conversion) 
and fluorescent partners such as GFP, EGFB, BFP, etc. 

With respect to chemical and enzymatic modifications of 
contemplated polypeptides, it is many modifications are ^propriate, 
including addition of mono-, and bifunctional linkers, coupling with 

10 protein- and non-protein macromolecules, and glycosylation. For example, 
mono- and bifunctional linkers are especially advantageous where poly- 
peptides are immobilized to a solid support, or covalently coupled to a 
molecule that enhances immunogenicity of contemplated polypeptides 
(e.g,, KLH, or BSA conjugation). Alternatively, the polypeptides may be 

15 coupled to antibodies or antibody fragments to allow rapid retrieval of the - 
polypeptide Jfrom a mixture of molecules. Further couplings include 
covalent and non-covalent coupling of polypeptides with molecules that 
prolong the serum half-life and/or reduce immunogenicity such as 
cyclodextranes and polyethylene glycols. 

20 

Use of Prostate-Specific Or Testis-Specific Antibodies 

Antibodies to prostate-specific or testis-specific proteins are used to 
detect prostate-specific or testis-specific proteins or to inhibit the biological 
activities of prostate-specific or testis-specific proteins. For example, a 
25 nucleic acid molecule encoding an antibody or portion of an antibody can 
be expressed within a cell to inhibit prostate-specific or testis-specific 
function. In addition, the antibodies can be coupled to compounds, such as 
radionuclides and liposomes for diagnostic or therapeutic uses. Antibodies 
that inhibit the activity of a prostate-specific or testis-specific polypeptide 
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can also be useful in preventing or slowing the development of a disease 
caused by inappropriate expression of a wild type or mutant prostate- 
specific or testis-specific gene. For example, the antibodies of the 
invention may be utilized to localize and locally quantify disease-specific 
5 markers in prostate or testis tissue sections, e.g, in prostate or testicular 
cancer. 

Detection Of Prostate-Specific Or Testis-Specific Gene Expression 

As noted, the antibodies described above can be xised to monitor 

10 prostate-specific or testis-specific protein expression. In situ hybridization 
of RNA can be used to detect the expression of prostate-specific or testis- 
specific genes. RNA in situ hybridization techniques rely upon the 
hybridization of a» specifically labeled nucleic acid probe to the cellular 
RNA in individual cells or tissues. Therefore, RNA in situ hybridization is 

15 a powerful approach for studying tissue- and temporal-specific gene 

expression. In this method, oUgonucleotides, cloned DNA fi-agments, or 
antisense RNA transcripts of cloned DNA firagments corresponding to 
unique portions of prostate-specific or testis-specific genes are used to 
detect specific mRNA species, eg., in the tissues of animals, such as mice, 

20 at various developmental stages, or to monitor tumor progression. Other 
gene expression detection techniques are known to those of skill in the art 
and can be employed for detection of prostate-specific or testis-specific 
gene expression. 

25 Identification of Additional Prostate-Specific Or Testis-Specific Genes 
Standard techniques, such as the polymerase chain reaction (PGR) 
and DNA hybridization, as well as the SSH and other techniques described 
herein, can be used to clone prostate-specific or testis-specific homologues 
ia other species and other prostate-specific or testis-specific genes in 
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humans. Prostate-specific or testis-specific genes and homologues can be 
readily identified using low-stringency DNA hybridization or low- 
stringency PGR with human prostate-specific or testis-specific probes or 
primers. Degenerate primers encoding human prostate-specific or testis- 
specific or hxraian prostate-specific or testis-specific amino acid sequences 
can be used to clone additional prostate-specific or testis-specific genes and 
homologues by RT-PCR. 

Additional prostate-specific or testis-specific genes include genes 
expressed during various growth and developmental phases of the diseased 
prostate or testis, e.g., those involved in prostate cancer, benign prostatic 
hyperplasia, or testicular cancer, and genes expressed as a result of a drug 
regimen. 

Construction of Transgenic Animals and Knockout Animals 
15 r ■ ; Characterization of prostate-specific or testis-specific genes provides 
/. >' information that allows prostate-specific or testis-specific knockout animal 
models to be developed by homologous recombination. Preferably, a 
prostate-specific or testis-specific knockout animal is a mammal, most 
preferably a mouse. Similarly, animal models of prostate-specific or testis- 
20 specific overproduction can be generated by integrating one or more 

* prostate-specific or testis-specific sequences into the genome of an animal, 
according to standard transgenic techniques. Moreover, the effect of 
prostate-specific or testis-specific gene mutations {e.g., dominant gene 
mutations) can be studied using transgenic mice carrying mutated prostate- 
25 specific or testis-specific transgenes or by introducing such mutations into 
the endogenous prostate-specific or testis-specific gene, using standard 
homologous recombination techniques. 

A replacement-type targeting vector, which can be used to create a 
knockout model, can be constructed using an isogenic genomic clone, for 
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example, from a mouse strain such as 129/Sv (Stratagene Inc,, LaJolla, 
CA). The targeting vector can be introduced into a suitably-derived line of 
embryonic stem (ES) cells by electroporation to generate ES cell lines that 
carry a profoundly truncated form of a prostate-specific or testis-specific 
5 gene. To generate chimeric founder mice, the targeted cell lines are 

injected into a mouse blastula-stage embryo. Hetero2ygous offspring can 
be interbred to homozygosity. Prostate-specific or testis-specific knockout 
mice provide a tool for studying the role of prostate-specific or testis- 
specific polypeptides and nucleic acid molecules in embryonic 
10 development and in disease. Moreover, such mice provide the means, in 
vivo, for testing tiierapeutic compounds for ameUoration of diseases or 
conditions involving a prostate-specific or testis-specific polypeptide or 
nucleic acid molecule-dependent or prostate-specific or testis-specific 
polypeptide or nucleic acid molecule-affected pathway. 

15 

Animal Models 

The prostate-specific and testis-specific polypeptides, antisense 
compounds, etc., of the invention can also be used in conjunction with 
animal models of prostate or testis disorders, to test the therapeutic, 

20 diagnostic, and screening methods of the invention. An exemplary prostate 
cancer model in transgenic mice is called TRAMP, in which the S V40 large 
T antigen is targeted to the prostate (Greenberg et al., PNAS 92, 3439- 
3443, 1995). Another test system is the CWR22 (androgen-dependent) and 
CWR22R (androgen-independent) xenografts, as known in the art and as 

25 described herein. Growth, PSA secretion, metastasis, etc. of tiiese 

xenografts could be monitored in the presence and absence of the prostate- 
specific or testis-specific polypeptides, nucleic acid molecules, and other 
compounds of the invention. Other animal models, for example, animal 
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models of other forms of cancer, or inrnixinocomproinised animals, e.g., 
nude mice, may also be used. 

The following Examples will assist those skilled in the art to better 
5 understand the invention and its principles and advantages. It is intended 
that these Examples be illustrative of the invention and not limit the scope 
thereof. 

10 EXAMPLE 1 

Suppression Subtraction Of Prostate- And Testes-Specific Genes And 
Subcloning Into Pzero 

cDNA derived from poly(A)+ RNA of 10 different normal human 
tissues were subtracted against normal human prostate cDNA using 

15 siq)pression subtraction hybridization (SSH) (Diatchenko, L. et al., Proc. 
Natl Acad. ScL USA 93, 6025-6030, 1996) and the resulting cDNA 
fragments were cloned into an appropriate vector. SSH was performed as 
described (Clontech PCR-Select Cloning Kit) using prostate poly(A)+ RNA 
against a pool of poly(A)H- RNA obtained from ten normal human tissues 

20 (heart, brain, placenta, lung, liver, skeletal muscle, kidney, spleen, thymtis, 
and ovary). Upon secondary PGR amplification (12 cycles), the reactions 
were extracted with phenol/chloroform, the DNA with ethanol, and the 
pellets washed once with 70% ethanol. After drying, the DNA pellet was 
dissolved in 0.2XTE or MQ dHiO and cut with Rsal in a 20 ^il reaction for 

25 2 hrs at 3TC to excise adaptors. After digestion, the reactions were run on 
a 1.5% agarose gel, with molecular size markers on one side, at 5 V/cm, 40 
min. Care was taken not to expose the gel to short wavelength UV Ught. 
The adapter bands were excised, and the gel was run at 5 V/cm for 15 min 
in a reversed electric field to concentrate the cDNA bands. 
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The gel was visualized (long wave UV light) and the amplijSed 
cDNAs, ranging in size between 100 bp-lkB, were excised. The DNA was 
purified using the QAIBX gel DNA purification kit. The purified DNA 
was cloned into EcoRV-cut, dephosphorylated pZERO (Invitrogen). 
5 Ligation reactions were performed in 10 jil final volume in the presence of 
5% PEG, IX T4 Ligase buffer at 3TC overnight and a 1/5 dilution of 1^1 
of the ligation mix (PSL) was transformed into DHIOB electrocompetent 
cells (>10^** efficiency) or equivalent. Colonies were picked and the 
presence of cDNA inserts was confirmed. To that end, PCR was performed 
10 with T7 and SP6 primers directly from the colonies. 10% of the reactions 
were run on a 1.5% agarose gel to visualize amplified products. The 
colonies wilh inserts were grown and glycerol stocks (15%) were prepared 
and stored at -80°C. 



15 EXAMPLE 2 

Reverse Nortfaem Blot And Sequence Analyses 

To clone androgen-responsive genes represented in the PSL, the " 
reverse northern technique was used (Hedrick, S.M. et al., Nature 308, 149- 
153, 1984; Sakaguchi,N.etal.,£MB075: 2139-2147, 1986). In this 

20 procedxire, RNA made firom two populations of cells that are to be 

compared is used to make cDNA probes that are then hybridized to two 
identical arrays of clones. To that end, PSL clones were amplified by PGR 
and spotted on nylon filters in 96-welI format to generate two identical 
blots for each set of 92 clones (the remaining four spots were used for 

25 positive and negative controls). To make the probes, the androgen- 
responsive prostate cancer cell line LNCaP was used (Horoszewicz, J.S. et 
al., Cancer Res. 43, 1809-1818, 1983) and was either left untreated (the (-) 
probe) or treated with the synthetic androgen R1881 for 24 hours (the (+) 
probe). Poly(A)4- RNA was isolated firom these cells and was xxsed to make 
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the P-labeled probes. After hybridization with the (-) and (+) probes, 
clones that showed differential hybridization were selected for further 
analysis, i.e., confirmation by a secondary reverse northern blot, and 
northern blotting. 

5 Reverse northern screening on the cDNA clones was done essentially 

as described previously (Hedrick, S.M. et al., supra\ Sakaguchi, N. et al., 
supra) with some modifications. DNA (approximately 400 ng) from PGR 
amplification in step 6 was diluted in 200 \i\ of 0.4M NaOH, 10 mM EDTA 
and mixed well by pipetting. After incubation at 95°C for 5-10 minutes, 

10 the tubes were chilled on ice. Denatured DNA was blotted on two separate 
pieces of Zeta Probe GT+ membrane (Bio-Rad) using a dot-blot apparatus 
(Bio-Rad), Positive (Prostate specific antigen (PSA) cDNA) and negative 
(glyceraldehyde 3-phosphate dehydrogenase (G3PDH) cDNA) controls 
were included on each blot (bottom right) in duplicate. The membranes 

15 were rinsed with 2XSSC, air dried, and then baked at 80°C for 30 minutes. 
An exemplary reverse northern analysis is shown in Figure 1 . Note that 
there was a substantial uicrease in PSA hybridization in the (+) blot (probe 
prepared from cells that have been stimulated by androgens) compared with 
the (-) blot (probe prepared from unstimulated cells), whereas there was no 

20 significant change in hybridization of G3PDH between the two blots. 
Arrowheads indicate the positive clones identified in this experiment. 

To verify the tissue-specific nature of the isolated sequences, 
positive clones were tested in a standard northern blot against RNA 
preparations of multiple non-prostate tissue samples. Figure 2 shows a 

25 multiple tissue northern blot using NKX3 A as a probe, to show an 

exemplary tissue e?q)ression pattern seen in the positive clones. Lanes 1- 
10, and 12-16 are RNA preparations from non-prostate tissues, lane 11 is a 
RNA preparation from prostate, lane 12 is a RNA preparation from testis. 
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Twelve clones with no significant homology to known sequences (by 
BLAST analysis) were isolated from prostate tissue and LNCaP cells. SEQ 
ID NOs: 1-9 were identified as androgen-responsive differentially- 
expressed genes in the prostate, while SEQ ID NOs: 10-12 were identified 
5 as androgen-responsive differentially-expressed genes in LNCaP cells. 

EXAMPLE 3 

Isolation And Characterization Of The STMPl Gene And mRNA 

A normal prostate cDNA hbrary was screened by 5'- and 3'-RACE 
10 analysis, and resulted in the full-length cDNA for L74. Since computer- 
aided secondary structure prediction of the deduced amino acid sequence of 
L74 suggested the presence of a six-transmembrane domain in its C- 
terminal half, L74 was renamed Six-Transmembrane Protein of Prostate 1 
(STMPl). 

15 When the full-length STMPl cDNA was used in BLAST analysis, it , 
was found to match a BAG clone (GenBank accession # AC002064) except 
for a 3 1 3 bp repetitive unit in the 3 ' UTR region, thereby identifying it as 
the STMPl gene and localizing it to Chr7q21, The repetitive region is likely 
to be a cloning or sequencing artifact of the BAG clone. Gomputational 

20 exon/intron junction analysis and alignment of flie full-length cDNA 

sequence with the BAG clone revealed that STMPl gene is composed of six 
exons and five introns (Figure 4A). The transcription start site, the location 
and size of the exons and introns, and the location of the partial cDNA 
clone L74 (black box) are indicated. The start (atg) and stop codons (tga), 

25 as well as the putative polyadenylation signal (pA) are also indicated. The 
first two exons are short, non-coding exons of 83 and 61 bp, whereas exons 
3-6 encode the open reading frame (ORF) and are 525, 528, 165, and 3281 
bp long, respectively (Figure 4G). The STMPl gene spans around 26 kb, 
which is in part due to the extremely large size of intron 2 (12713 bp). 
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There are three different predicted promoters within 4 kb upstream of the 
STMPl initiation codon, none of which has any significant TATA or 
CAAT box consensus sequences, suggesting that STMPl is transcribed 
from a TATA-less promoter. 
5 The STMPl cDNA (GenBank accession # AY008445) has a predicted 
5' untranslated region (5'UTR) of approximately 1 kb (deduced by RACE 
analysis) and an unusually long 3 'UTR of approximately 4 kb that 
comprises --77% of the total cDNA sequence. The ORF starts within the 
3^ exon and is predicted to encode a 490 amino-acid protein (Figure 4B). 

1 0 A searcb for protein motifs identified six predicted transmembrane domains 
in the C-terminal half of STMPl starting at F209 (Figures 4B and 4E). 
Only the cDNA sequence surrounding the ORF is indicated. The exon- 
intron junctions are indicated and the location of the predicted 
transmembrane domains are highlighted (TM 1-6) (Figure 4B). The stop 

15 codon is indicated with an asterisk. STMPl has two alternatively spliced 
forms, shown in Figures 4F-4K, which lead to two predicted isoforms of 
the protein. 

EXAMPLE 4 

20 STMPl Belongs To A New Subfamilv Of Six-Transmembrane Domain 
Proteins 

BLAST analysis of GenBank with the predicted STMPl amino acid 
sequence identified two independent ESTs and STEAP, a recently 
discovered cell membrane protein enriched in prostate for expression. An 
25 ahgnment of these sequences, obtained by Clustal and GenDoc programs, is 
shown in Figure 5. Completely conserved residues are shaded in black; 
residues that are conserved in two or three of the sequences are shaded light 
and dark gray, respectively. This alignment suggested that while the EST 
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BAA91839 cDNA may be close to full-length, BAB15559 cDNA may 

represent a partial sequence. 

The sequences of two proteins related to STMPl were determined 

(Figures 4L and 4M, STMP2 and STMP3, respectively). The STMP2 and 
5 STMP3 sequences contain the EST sequences. The GFP-fusion of STMP2 

gives similar localization as STMPl. Both STMP2 and STMP3 are more 

widely distributed and have higher levels in some tissues other than the 

prostate. For example, STMP2 has the highest expression in the placenta 

and the lung, and is also highly expressed in the heart, liver, prostate, and 
10 testis, while STMP3 has the highest expression in the liver, and is also 

highly expressed in the heart, placenta, lung, kidney, pancreas, prostate, 

testis, small intestine, and colon. 

The sequence similarity between STMPl and STEAP is limited and 

not significant before residue 210 of STMPl where the predicted six- 
15 transmembrane coding domain starts. This suggests that the N-terminal 

region is structurally and functionally related among STMP proteins, 

forming a six-transmembrane protein subfamily that is distinct from 

STEAP. 

20 EXAMPLES 

STMPl Expression Is Highlv Enriched In Prostate 

The expression profile of STMPl was then deterauned in various 
human tissues by Norfhem analysis, in which a multiple tissue Northem 
blot was hybridized to the STMPl probe (see Materials and Methods). As 

25 shown in Figure 6A, STMPl hybridized to a major mRNA species of 6.5 
kb, and three minor mRNA species of 2.2, 4.0, and 4.5 kb in the prostate 
tissue. The stronger hybridization that is observed with G3PDH in the 
heart and skeletal muscle samples is due to its higher expression in these 
tissues. The lanes represent: LHeart, 2. Brain, 3. Placenta, 4. Limg, 5. 
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Liver, 6. Skeletal Muscle, 7. Kidney, 8. Pancreas, 9. Spleen, 10. Thymus, 
11. Prostate, 12. Testis, 13. Ovary, 14. Small Intestine, 15. Colon, 16. 
Peripheral blood leukocyte. The location of the full-length 6.5 kb niRNA, 
as well as the lower molecular weight STMPl species are indicated by 
5 arrows to the left of the figure. There was 15-20-fold lower mRNA 
expression of the 6.5 kb band in the heart, brain, kidney, pancreas, and 
ovary, compared to prostate, and no detectable expression in other tissues. 
In contrast, the three lower molecular weight species, encoded by 
alternatively spliced forms of STMPl, were only detectable in the prostate. 

10 Hybridization with a glyceraldehyde 3-phosphate dehydrogenase (G3PDH) 
cDNA probe resulted in approximately similar signals in all lanes, except 
for the heart and skeletal muscle where G3PDH is known to be more 
abimdant compared with other tissues. These data show that STMPl 
expression is high in the prostate, although expression can be seen in other 

1 5 tissues, and that STMPl has isoforms that are restricted to the prostate for 
expression. 

EXAMPLE 6 
Characterization Of STMPl Expression 

20 Since androgen is a major hormonal stimulus for the normal prostate 

gland and for early stage prostate cancer, the possible androgen regulation 
of STMPl was assessed by Norfhem analysis in the androgen-responsive 
prostate cancer cell line LNCaP. Cells were either left untreated or treated 
with the synthetic androgen R1881 (10"^ M) with increasing amounts of 

25 time (hours) as indicated (Figure 6B), harvested, and total RNA isolated 
and used in Northern analysis with STMPl cDNA as probe. The same 
membrane was also probed for the androgen-dependent gene PSA. 
Relative induction of mRNA accumulation is indicated at the bottom of the 
lanes, as determined by phosphorimager analysis (Molecular Dynamics). 
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The CWR22 xenograft was grown in nude mice and tumor samples were 
collected either before (t=0) or 1, 2, or 4 weeks after castration. Total RNA 
was isolated and was then used in Northern analysis with the same probes. 
Ethidium bromide-stained 18S RNA is shown as a control for RNA 
5 integrity and loading. At 6 h, there was an approximately 25% increase in 
STMPl expression, which was lost by 24 h, with a final 20% decrease 
observed at 48 h compared with basal levels. In contrast, the mRNA 
accumulation of the androgen-regulated gene PSA dramatically increased 
upon androgen stimulation in a time-dependent manner, as expected, 

10 reaching approximately 22-fold higher levels by 48 hours. Relative 

induction of STMPl ixiRNA accumulation is indicated at the bottom of the 
lanes determined by phosphorimager analysis. As is shown in Figure 6B, 
STMPl displayed similar expression levels in untreated and R1881-treated 
LNCaP cells, indicating that STMPl expression is not significantly 

1 5 regulated by androgens in LNGaP cells. 

To determine the possible androgenic regulation of STMPl expression 
in an in vivo setting, the androgen-dependent xenograft model CWR22, 
which is derived firom a primary human prostate tumor was used 
(Wainstein, M. A. et al., Cancer Res 54, 6049-6052, 1994). Since they are 

20 androgen-dependent for growth, the CWR22 tumors in nude mice display 
marked regression upon castration and may regress completely. CrWR22 
xenografts were grown ia nude mice in the presence of a sustained release 
testosterone pellet After the tumors had grown, the mice were castrated, 
the testosterone pellets were removed, and the regressing tumors were 

25 collected at 1, 2, or 4 weeks post-castration. Total RNA was prepared from 
these tumor samples and used in Northern analysis. As shown in Figure 
6B, similar to the obsevations in LNCaP cells, STMPl mRNA 
accumulation in the CWR22 tumors showed no significant change upon 
castration and was not affected by the presence of androgens (note that 
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there is underloading of RNA for CWR22 2wk sample). In contrast, the 
mRNA accumulation of the androgen-regulated gene PSA was dramatically 
decreased upon castration, dropping to approximately 16% of pre-castrate 
levels by two weeks post-castration. These results are consistent with the 
5 findings in LNCaP cells and suggest that STMPl expression is not 
significantly regulated by androgens in prostate cancer cells. STMPl 
expression was substantially lower in the CWR22 tumors compared with 
LNCaP cells. 

The expression profile of STMPl was also analyzed in the androgen- 
ic independent prostate cancer cell lines PCS and DU145, as well as in four 
independent, relapsed derivatives of CWR22 tumors, named CWR22R 
(Nagabhushan, M. et al.. Cancer Res 56, 3042-3046, 1996), representative 
of advanced prostate cancer (Figure 6C). LNCaP (in the presence (+) or 
absence (-) of R1881 (10"^ M)), PC-3, or DU-145 cells were grown and 
15 total RNA was isolated. Four independent lines of the androgen 

independent human prostate cancer xenograft CWR22R, were grown in . 
nude mice, tumors were collected, and total RNA was isolated and used in ^ 
Northem analysis with STMPl or the androgen target gene NKX3J cDNAs 
as probes. Ethidium bromide-stained 18S RNA is shown as a control for 
20 RNA integrity and loading. The relative induction of STMPl and NKX3, 1 
mRNA accmnulation is indicated at the bottom of the lanes determined by 
phosphorimager analysis (Molecular Dynamics). As is shown in Figure 
6C, STMPl expression was high in LNCaP cells and did not significantly 
change in response to R1881 treatment compared with a ~9-fold induction 
25 of the androgen target gene NKX3,L There was no STMPl expression in 
the androgen-independent prostate cancer cell lines PC-3 or DU-145, as 
was the case for NKX3.L In contrast, there was significant STMPl 
expression in tumors firom all four independent CWR22R xenograft lines 
tested, ranging between -30-60% of that observed in LNCaP cells. A 
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similar overexpression pattern was also observed for NKX3.1 (Figure 6Q 
consistent with previous findings (Korkmaz, K. S. et al, Gene 260, 25-36, 
2000). 

An interesting property of STMPl expression profile is that even 
5 though it is e;q)ressed at low levels in the androgen dependent CWR22 
xenograft, it is highly expressed in the relapsed CWR22R which is 
androgen receptor (AR) positive, but is not responsive to androgens. This 
indicates that STMPl expression is deregulated once the prostate tumor 
progresses firom an androgen-dependent to an androgen-indep^dent phase. 
10 In addition, STMPl is not expressed in the AR-negative prostate cancer cell 
lines PC-3 and DU-145, but is expressed at high levels in the AR-positive 
ceU line LNCaP and the CWR22 and CrV^2Rxenogra^^ Thus, 
expression of STMPl is correlated with the presence of a functional AR in 
the cell. 

15 It has been known for over 50 years that androgens play a key role 

both in the development and maintenance of the normal prostate and the 
initiation and progression of prostate cancer. Androgen withdrawal results 
in involution of both the normal prostate gland as well as a prostate tumor 
in the early stages of the disease that is still androgen dependent. 

20 Consequently, androgen withdrawal is commonly used as treatment to 
reverse tumor growth. However, in the case of the prostate tumor, after a 
few months or years, the tumor recurs in ahnost all cases in an androgen- 
independent state. At this point there is no effective therapy and prognosis 
for survival is extremely poor. Since STMPl is overe?q)ressed during this 

25 later androgen-insensitive state, it will be a usefiil tool in diagnostic and 
therapeutic appUcations for prostate cancer. 

These data indicate that STMPl expression is deregulated once 
prostate cancer progresses from an androgen-dependent to an androgen- 
independent state. 
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EXAMPLE? 
Intracellular Localization Of STMPl 

To gain insight into the intracellular localization pattern of STMPl, a 
5 green fluorescent protein (GFP)-STMP1 fusion protein was generated. The 
use of such GFP chimeric proteins has recently become a standard method 
to assess intracellialar localization and dynamics of proteins, COS-1 cells 
were transiently transfected with GFP-STMPl, fixed and processed for 
confocal microscopy as described in Materials and Methods. 

10 A series of 11 confocal sections along the z-axis were collected 

tiirough a single cell at nominal 100 nm intervals. Three of the consecutive 
sections and the projection of all 1 1 sections are shown in Figure 7A. 
Arrows indicate tubular-vesicular structures (VTS) in different sizes, 
shapes, and locations CBar=5|jm). In all 1 1 z-plane sections, GFP-STMPl 

1 5 showed bright jxixtanuclear distribution pattern, characteristic of the Golgi 
complex. Additionally, GFP-STMP l was dispersed in spots of variable size 
throughout the cytoplasm and "at the cell periphery (z-7, projection). Some 
of these bright fluorescent spots were tubular (z-6, arrow and Figure 8) or 
vesicular (z-5, arrow) in morphology. 

20 To determine more directly whether GFP-STMPl was localized to 

the Golgi complex, we compared its intracellular distribution with those of 
two well characterized Golgi markers, the medial Golgi enzyme 
mannosidase H (Manll) (Rabouille, C. et al., J Cell Sci 108, 1617-1627, 
1995) and the coat protein P-COP (Pepperkok, R. et al. Cell 74, 71-82, 

25 1993). COS-1 cells were transfected with GFP-STMPl, fixed, labeled with 
the appropriate primary and secondary antibodies and imaged by confocal 
laser scanning microscopy. Green GFP-STMPl fluorescence and red 
(Texas Red-labeled secondary antisera) p-COP and Manll fluorescence 
were detected by confocal laser microscopy. Panels to the right show the 
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overlay images with yellow/orange staining indicating the regions of 
colocalization. Bars=5|am. As shown in Figure 7B, the distribution of 
GFP-STMPl extended throughout the Golgi complex, as evidenced by 
significant colocalization with both Manll and p-COP. However, some 
5 areas of non-overlap between the GFP-STMPl and both Golgi markers 
were observed suggesting that STMPl, at least in part, is differentially 
localized within the Golgi complex compared with these two markers. 

Since GFP-STMPl was associated with VTS (Figure 7A and Figure 8), 
more specific localization of GFP-STMPl to the trans-Golgi network 

10 (TGN), an important site for the sorting of proteins destined to the plasma 
membrane, secretory vesicles, or lysosomes (Farquhar, M. G, & Palade, G. 
E. Trends Cell Biol 8, 2-10, 1998; MeUman, L & Warren, G., Cell 100, 99- 
112, 2000; Lenraion, S, K & traub, L, M., Curr Opin Cell Biol 12, 457- 
466, 2000) was assessed. An antibody against TGN46, a TGN resident 

15 protein that shuttles between the TGN and the plasma membrane (Prescott 
AR, et al., Eur J Cell Biol 72, 238-246, 1997; Ponnambalam, S. et al., J * 
Cell Sci. 109, 675-685, 1996), was used in immunoflourescence 
microscopy experiments as above. As shown m Figure 7B, GFP-STMPl 
extensively colocalized with TGN46, greater than that observed with ManU 

20 and p-COP, suggesting that in the Golgi complex, STMPl is primarily 
localized to .the TGN. Note that the images with TGN46 were obtained 
with lower objective power. 

EXAMPLE 8 

25 STMP 1 Shuttles Between The Golgi And The Plasma Membrane And 
Colocalizes To The Early Endosomes 

The dynamic properties and intracellular trafficking of GFP-STMPl 
were studied using confocal time-lapse imaging in living cells. COS-1 cells 
were transiently transfected with GFP-STMPl and, 16 h after transfection, 
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12 consecutive images were collected from live cells every 20s at 37*^0 by 
confocal laser scaiming microscope (Figure 8). The upper panel shows a 
VTS extending out and retracting back to the Golgi body (white arrows). 
In the middle panel and the &st image in the lower panel (160s), red 
5 arrows indicate the translocation of a VTS from the Golgi body to the cell 
periphery. In the lower panel, yellow arrows point to the movement of a 
VTS from the edge of the cell towards the Golgi body. Note that tiie results 
shown are representative of multiple time-lapse analyses and the changes in 
the images are not due to movement from the plain of focus. Bar=5|am. 

10 As shown in Figure 8, some VTS were found to be detaching and some 

to be associating with the Golgi complex. The VTS were hi^y dynamic 
and pleiomorphic in size. Some of the VTS followed straight or curvilinear 
paths, some moved in a stop-and-go fashion, and some showed saltatory 
movements. The VTS indicated at the top panel (white arrows) extended 

1 5 away from and then retracted back to the Golgi. The VTS in the middle 
panel and the first image in the lower panel (red arrows) detached from the 
Golgi complex, paused, and then moved towards the cell periphery until it 
disappeared at the cell edge suggesting that STMPl is associated with the 
secretory pathway. The VTS in the lower panel (yellow arrow) moved 

20 from the cell periphery towards the Golgi body suggesting that STMPl is 
localized to the endocytic pathway. 

EXAMPLE 9 

Colocalization Of GFP-STMPl With The Earlv Endosomal Marker EEAl 
25 To probe whether GFP-STMPl was associated with the endocytic 

pathway, the intracellular distribution of GFP-STMPl was compared with 
that of the early endosome protein EEAl (Stenmark, H. et al., J Biol Chem 
271, 204048-204054, 1996). COS-1 cells were transfected with GFP- 
STMPl, fixed, immunostained with EEAl antibodies and observed by 
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confocal laser scanning microscopy. Green GFP-STMPl fluorescence and 
red (Texas Red-labeled secondary antiserum) EEAl fluorescence were 
detected by confocal laser microscopy. The panel to the right shows the 
overlay images with yellow/orange staining indicating the regions of 
5 colocalization. Arrows indicate examples of the VTS in the cell periphery 
which contain both EEAl and STMPl. Bar=5|um. As shown in Figure 9, 
EEAl manifested a similar intracellular distribution in both transfected and 
untransfected cells. Fxirthermore, GFP-STMPl significantly colocalized 
with EEAl both in the cell periphery and also in the perinuclear area 
10 (Figure 9, arrows) suggesting that STMPl is associated with early 
endosomes and the endocytic pathway. 

EXAMPLE 10 

Isolation And Characterization of the SSH9 Gene And mRNA 
15 The SSH9 gene was identified and mapped (Figure 10). The 

predicted promoter site,* the transcription start site, and the location and size 
of the exons and introns are indicated. The start and stop codons, as well as 
two polyadenylation signals, leading to two alternatively spliced transcripts, 
are also indicated. Figures 1 1 A-C show the nucleotide and predicted amino 
20 acid sequence of SSH9, as well as the predicted promoter sequence and 
exon-intron boundaries. 

The expression profile of SSH9, determined in various human tissues 
by Northern analysis (Figure 12C), revealed that the 0.7 kb spUce variant of 
SSH9 was highly testis-specific, while the 1.4 kb transcript was expressed 
25 in both prostate and testis. 

The androgen regulation of SSH9 was examined in LNCaP ceUs and 
in CWR22 xenografts (Figure 12A) revealed that SSH9 is not regulated in 
LNCaP cells, but is regulated ia CWR22 xenografts. The expression 
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profile of SSH9 was also examined in the androgen-independent prostate 
caacer cell lines PCS and DU145, and in CWR22R cells (Figure 12B). 

EXAMPLE 11 

5 Isolation And Characterization of the PSL22 Gene And mRNA 

The PSL22 gene was identified and mapped (Figure 13). The 
location and size of the exons and introns, the location of the partial cDNA 
clone (black box), as well as the alignment of the fidUength cDNA clone 
with GenBank Accession Nos. AC00855 1 and ACOl 1449, are indicated. 
10 Figures 14A-C show the nucleotide sequence of the ORF, cDNA and 

predicted amino acid sequence, as well as the predicted promoter, exon, and 
UTR sequences of PSL22. 

BLAST analysis of GenBank with the predicted PSL22 amino acid 
sequence identified PSL22 as a Rho binding protein. Figure 15 shows a 
1 5 multiple sequence alignment of PSL2S with related proteins. Completely 
^ conserved residues are shown in black; residues foxmd in three sequences 
are shaded. 

The expression profile of PSL22, determined in varioxis human 
tissues by Norfhem analysis (Figure 1 16B), revealed that while the highest 
20 expression was seen in the prostate, high expression was seen in the kidney, 
pancreas, and colon. 

The androgen regulation of PSL22 was examined in LNCaP cells, in 
the androgen-independent prostate cancer cell lines PC3 and DU145, and in 
CWR22R cells (Figure 16A). The results showed that PSL22 is androgen 
25 regulated in LNCaP cells, where it is highly expressed, but is not androgen 
regulated in the PCS and DU145 cells. 
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EXAMPLE 12 

Materials And Methods 

The following materials and methods were used in performing the 
exemplary experiments shown herein. It is understood that these materials 
5 and methods are subject to modifications that do not change the nature of 
the invention, as will be understood by those of ordinary skill in the art. 

Probes 

Poly (A)+ RNA 1 ng [(-) or (+)] 

10 Random primer (N7) 200 ng 

RNAse-free sterile H2O to 20 ul 
Heat at 70°C for 10 min, and chill on ice. 

. While heating the RNA samples, the following solution was 



prepared: 

15 5X 1st strand buffer 10 ul 

O.lmMDTT 5 \il 

lOmMeachdTTP+dGTP 2\il 
^¥ alpha dATP 5 ^l 

^^P alpha dCTP 5^1 



20 Superscript n (200 U /)il, BRL) 2^1 

The solution was mixed by pipetting, spim briefly, incubated at 25°C 
for 5 min, and then for an additional 1 hour at 37°C. 2 ^1 of lOmM dCTP + 
dATP was added and the mixture was incubated for 30 min at 37°C and 
then heat inactivated at 70°C for 10 min. Unincorporated nucleotides were 

25 removed using prespxm G25 columns (Bio-Rad). Specific activity (which 
should be over 5xl0^cpm/^g) was calculated. 
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Hybridization 

Freshly prepared 25 ml Hybridization mix (7% SDS, 0.5 M 
NaHP04, ImM EDTA) was pre-warmed at 65°C and 12.5 ml was used for 
prehybridization of each membrane, 5-10 min at 65 "^C. The probe was 
5 heat denatured at 95°C for 3-5 min and transferred to the prehybridization 
mix at 65*'C. Hybridization was carried out at 65*^0 overnight. 

Washing 

Wash solution I (2xSSC and 1% SDS) and H (O.lxSSC and 0.5% 
1 0 SDS) were prewarmed, and the membrane were washed once with Solution 
I and then with Solution n for 30 min at 65°C. The membranes were 
covered with plastic wrap and exposed to a phoshorimiager screen. 

Selection 

1 5 Clones that showed differences between the (-) and (+) blots were 

picked (usually 1-8 on each blot pair). A secondary round of reverse 
norfhem analysis for confirmation was performed, this time spotting each 
clone in duplicate on each blot. After phosphorimager analysis, the blots 
were stripped in O.lxSSC and 0.5% SDS for 2x15 min at 95°C and 

20 hybridized with a PSA probe (or depending on the hormone that is being 
.used, with a probe for any abundant target genes in the tissue under study). 
For the clones that were confirmed to be different from PSA, for 
differential expression in the secondary reverse northern, norfhem analysis 
was performed using established protocols. A time course of R1881 

25 mduction of LNCaP cells, as well as the CWR22 xenograft model upon 
androgen ablation (Wainstein, M. A. et al., Cancer Res. 54, 6049-6052, 
1994) and the androgen-independent CWR22R relapsed xenograft 
(Nagabhushan, M. et al., Cancer Res. 56, 3042-6, 1996), was used. 
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Sequence analysis 

Sequence analysis was perfomed by the dideoxy chain termination 
methods using an ABI automated sequencer. Homology search was done 
using a basic BLAST algorithm. Figure 3 shows a table of results obtained 
5 from the BLAST analysis of isolated clones and their homology to known 
genes (The cutoff for significant homology was 50% identity). 

Isolation of prostate cancer related genes from LNCaP cells 

The prostate cancer cell line LNCaP was cultured in two batches in 

10 culture conditions similar to those previously described (Horoszewicz JS et 
al.. Cancer Res, 43: 1809-1818, 1983). The first batch was left untreated, 
while the second batch was treated with the synthetic androgen R1881 for 
24 hrs. Cells from both batches were harvested and total RNA was then 
isolated from each batch. From the total RNA, polyA"^ RNA was obtained 

15 using standard procedures, and was used in die Suppression Subtraction 
Hybridization (SSH; Diatchenko et al., stpra) procedure to identify hor- 
mone regulated genes. The tester in the SSH procedure was cDNA from 
untreated cells and the driver was cDNA from Rl 88 1 -treated cells. The 
suppression subtraction protocol was performed according to the original 

20 description of the method (Diatchenko et al., supra). 

Cell culture 

LNCaP, PC-3 and DU-145 cells were routinely maintained and 
treated as described previously (Korkmaz, K. S. et al., DNA Cell Biol 19, 
25 499-506, 2000; Korkmaz, K. S. et al., Gene 260, 25-36, 2000). 
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Xenograft studies 

Transplantatioa, growth, and harvesting of tumors from mice 
bearing the CWR22 and CWE22R xenografts were as previously described 
(Wainstein, M. A., supra\ Nagabhushan, M., supra), 

5 

Cloning and plasmid construction 

A 262 bp cDNA fragment was originally obtained from a screen of a 
prostate specific library (Ausubel, F. M., et al. (1997) Current Protocols in 
Molecular Biology (John Wiley and Sons, New York) and termed L74. 5' 

10 Rapid Amplification of cDNA Ends (RACE) was performed 

(oligonucleotide sequences available upon request) using the Marathon- 
Ready cDNA that was prepared from normal prostate tissue (Clontech) 
and/or SMART-RACE LNCaP cDNA library (Clontech) that was 
generated according to the manufacturer's recommendations. RACE 

15 products were cloned into pCRII-TOPO (Invitrogen), positive clones were 
confirmed by Southern analysis, and sequenced. In parallel, a XgtlO cDNA 
library made from a pool of normal human prostates (Clontech) was 
screened by established procedures to obtain additional clones. Overlapping 
clones were used to deduce the full-length STMPl cDNA sequence. 

20 The full-length STMPl ORP was amplified by using primers centered 

around the start and stop codons (sequences available upon request) and 
fused in frame to the C-terminus of green flourescent protein (GFP) using 
the vector pcDNA3.1-NT-GFP-T0P0 (Invitrogen) to generate GFP- 
SlMPl. 

25 

Northern analysis 

Total RNA was prepared by the single step guanidine thiocyanate 
procedure and used in Northern analysis (18). 15 |Jig of total RNA was used 
per lane. Probes were generated by random priming and had a specific 
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activity of >3xlO^ dpm/|ag. A cDNA fragment of STMPl spaiming residues 
145-2202 bp was used as probe. Bauds were visualized and qxiantitated by 
phosphorimager analysis (Molecular Dynamics). 

5 Confocal microscopy 

COS-1 cells were transiently transfected by electroporation using a 
BTX square-wave pulser at 150 V, 1 ms duration. Cells were grown eitber 
on cover slips placed in 6-well tissue culture plates for indirect 
immunofluorescence or on Lab-Tek Chambered Coverglass (Nalge Nunc 
1 0 International) for live-cell microscopy. Transiently transfected cells were 
observed 16 h after transfection by Leica TCS-SP confocal microscope. All 
live-cell experiments were done at 37°C. 

Indirect immunofluorescence 

15 The indirect immxmofluorescence was carried out as previously 

described (Misteli, T. & Spector, D. L. Mol Cell 3, 697-705, 1999). The 
following antibodies were used; anti-P-coat protein (P-COP) antiserum 
(kindly provided by J. Lippincott-Schwartz), anti-mannosidase n (kindly 
provided by T. Misteli), anti-TGN46 (Serotec, kindly provided by J.S. 

20 Bonifacino), and anti-EEAl (Affinity Biotechnologies). Texas Red- 
conjugated secondary antibodies specific for mouse and rabbit were 
purchased from ICN Biomedicals (Costa Mesa, CA). 

Other Embodiments 
25 All publications and patent appUcations mentioned in this 

specification are herein incorporated by reference to the same extent as if 
each independent publication or patent application was specifically and 
individually indicated to be incorporated by reference. 
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While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further 
modifications and this appHcation is intended to cover any variations, uses, 
or adaptations of the invention following, in general, the principles of the 
5 invention and including such departures firom the present disclosure that 
come within known or customary practice within the art to which the 
invention pertains and may be appUed to the essential features hereinbefore 
set forth, and follow in the scope of the appended claims. 
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CLAIMS 

1 . A substantially pure prostate-specific or testis-specific polypeptide, 
5 said polypeptide sequence comprising a sequence substantially identical to 

the sequence of any of SEQ ID NOS: 14, 29, 32, 34, 36, 41, or 53. 

2. The substantially pure prostate-specific or testis-specific polypeptide 
of claim 1, said polypeptide sequence comprising the sequence of any of 

10 SEQ ID NOS: 14, 29, 32, 34, 36, 41, or 53. 

3 . An isolated nucleic acid molecule encoding a polypeptide of claim 1 . 

4. The isolated nucleic acid of claim 3, wherein said nucleic acid 

15 molecule comprises the sequence of any of SEQ ID NOS: 23, 28, 31, 33, 
35, 40, or 52. 

5 . An isolated prostate-specific or testis-specific nucleic acid molecule, 
said nucleic acid molecule comprising a sequence substantially identical to 

20 SEQ ID NOS: 1-12, 22, 27, 30, and 51. 

6. An isolated prostate-specific or testis-specific nucleic acid molecule, 
said nucleic acid molecule consisting essentially of SEQ ID NOS: 15-21, 
24-26, 42-50, and 54-70. 

25 

7. The polypeptide of claim 1, wherein said polypeptide is derived 
firom a mammal. 

8. The polypeptide of claim 6, wherein said mammal is a human. 
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9. A vector comprising the isolated nucleic acid molecule of claim 3, 5, 
or 6. 

5 10. A cell comprising the isolated nucleic acid molecule of claim 3, 5, or 
6. 

11. A cell comprising the vector of claim 9. 

10 12. A non-human transgenic animal comprising the isolated nucleic acid 
molecule of claim 3, 5, or 6. 

13. An isolated nucleic acid molecule that hybridizes under high 
stringency conditions to the complement of any of the sequences set forth 

15 in SEQ ID NOS: 1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, 
wherein said isolated nucleic acid molecule encodes a prostate- 
specific or testis-specific. polypeptide. 

14. An isolated nucleic acid molecide, wherein said nucleic acid 

20 molecule comprises a sequence that is antisense to the coding strand of any 
of the prostate-specific or testis-specific nucleic acid molecules set forth in 
SEQ ID NOS: 1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, or a 
fragment thereof 

25 
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15. A probe for analyzing a prostate-specific or testis-specific gene or 
homolog or fi-agment thereof, said probe having greater than 55% 
nucleotide sequence identity to a sequence encoding any of SEQ ID NOS: 
1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, or fragment thereof, 

5 wherein said fragment comprises at least six anuno acids, and said 

probe hybridizes under high stringency conditions to at least a portion of a 
prostate-specific or testis-specific nucleic acid molecule, 

16. The probe of claim 14, wherein said probe has 100% 

10 complementarity to a nucleic acid molecule encoding any of SEQ ID NOS: 
1-12, 15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, or fragment thereof, 

wherein said fragment comprises at least six amino acids, and said 
probe hybridizes under high stringency conditions to at least a portion of a 
prostate-specific or testis-specific nucleic acid molecule. 

15 : 

17. An antibody that specifically binds to a prostate-specific or testis- 
specific polypeptide, said polypeptide comprising an amino acid sequence 
that is substantially identical to the amino acid sequence of any of SEQ ID 
NOS: 14, 29, 32, 34, 36, 41, or 53. 

20 

18. A method of detecting a prostate-specific or testis-specific gene or 
fragment thereof in a cell, said method comprising 

contacting the nucleic acid molecule of any of SEQ ID NOS: 1-12, 
15-28, 30, 31, 33, 35, 40, 42-50, 51, 52, or 54-70, or a fragment thereof, 
25 wherein said fragment is greater than about 18 nucleotides in length, with a 
preparation of genomic DNA from said cell, under high stringency 
hybridization conditions, and 
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detecting DNA sequences having about 55% or greater nucleotide 
sequence identity to any of SEQ ID NOS: 1-12, 15-28, 30, 3 1, 33, 35, 40, 
42-50, 51, 52, or 54-70, thereby identifying a prostate-specific or testis- 
specific gene or jfragment thereof. 

5 

19. A method for identifying a test compound that modulates the 
expression or activity of a prostate-specific or testis-specific polypeptide, 
said method comprising 

contacting said prostate-specific or testis-specific polypeptide with 
1 0 said test compound, and 

determining the effect of said test compound on said prostate- 
specific or testis-specific polypeptide expression or activity. 

20. A method of treating a mammal having a disorder of the prostate or 
1 5 testis, said method comprising 

administering to said mammal a therapeutically effective amouQt of 
a compound that modulates the activity or expression of a prostate-specific 
or testis-specific polypeptide, 

wherein said compoimd has a beneficial effect on said disorder in 
20 said mammal. 

2 1 . The method of claim 20, wherein said disorder is prostate cancer. 

22. A phannaceutical composition comprising at least one dose of a 
25 therapeutically effective amount of a prostate-specific or testis-specific 

polypeptide or fragment thereof, in apharmaceutically acceptable carrier, 
said composition being formulated for the treatment of a disorder of the 
prostate or testis. 
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23. The method of claim 20, wherein the prostate-specific or testis- 
specific polypeptide comprises an amino acid sequence substantially 
identical to the amino acid sequeace of SEQ ID NOS: 14, 29, 32, 34, 36, 
38, 39, 41, 53, or 71-73 and fragments and analogs thereof. 

5 

24. The method of claim 20, wherein said mammal is a human. 

25. A kit for the analysis of a prostate-specific or testis-specific nucleic 
acid molecule, said kit comprising a nucleic acid molecule probe for 

10 analyzing a prostate-specific or testis-specific nucleic acid molecule present 
in a test subject. 

26. A kit for the analysis of a prostate-specific or testis-specific 
polypeptide, said kit comprising an antibody for analyzing a prostate- 

15 specific or testis-specific polypeptide present in a test subject. 
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SEQ. ID 
NO. 


SEQ. 
NAME 


LENGTH 


SEQUENCE 


1 


PSL 22 


251 


ACTAATGTGAGGAAaCAAACATGTTCAGGCCTGAACATTTCCGGTGCTOACT 

CGGCcTTAAACGTTTGTGCCATAATGGAAAATATCTATCTATCTGTTCTCAA 

ATCCTGTTmCTCATAGTGTAAACTCACATTTGATGTGTTm 

AGTAACCAAGAAACCTCTAGGAATTAGGAAAAAAaGAACnTmTGAGGTG 

TGTTACTATACTGCTGTAAGTTATTTATTATATAAAGTATTGTAAATAGAAaT 

AGTGTTGAGATATGAAATATGGCrATTTTTAATGGTGACAATTATAGACT^ 

TAGkTCACTATTAAATTGGGGTTACCTATATCcAGT 


2 


PSL 
229A 


349 


ACACATCCATCATTGTGAAATCTCTTTTCCAACAAACGTCCT 

ACAATTCATrAAAaTCTTTGGGGACTAAGCTACGAACAAAGTTCAACTAAAC 

TACCTACTGACTTCAAAAGGAACATATACCCACCACGTGTGGTAGCTCATG 

ACTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGGAGGATCACXrrGAGCCC 

AGGAGTTCCAGACCAGCCTAAGCAACATGCCAAGACCCTGTATGT 


3 


PSL 
E15C 


51 


ACAAAGACACCCTTGTYCCCCGGGCAAGGTCXnrCCAGCTACAAGGGGGCCA 


4 


PSL 
El 56 


148 


CCXYACATTGTCACAGAGAGGCTCCAGGCTTAAAGTTGACXnXSCGTAGAAA 

(tPA AfiA ATOA ATTnTTfrffAfifTA AfrTA ArTfTAnnnPfrATTrJA ATA A An APTTT 
TAGCAGCTGGGCCAGCrrGAACCATCCX:AACCCTTCAAATCCCCTTGT 


5 


PSL 
El 57 


261 


ACCCTAACTGAACCCATTTCAGCCACTCAGATTGATAGGGTGGAAAAGACA 

nr? rrP AnrfTfrGTA fir AGrroTfr a a fi a a a a n a fifi a a a ftP a CtA a fififiTfrfirTT 

ATAATCTACAGGCATGTAGAGAGGACrACATAGGOn'CTGTrCTTTG^^ 

AGGAGCCCCCTTCCTGTCCCTTGGACTCAGAATGGATCCTTCCAGCACACAT 

GGCCCAACACTGAGAGTGCAGGAAGCATGGGTAGGGGCCTCCTGCTGCTGG 

TATGT 


6 


PSL 
E391 


121 


AGTNTGNGGGGAimGAGGGCNGNTACGNNAAANGNTGGNCrACTOT 

TGCrGCTCGAGCGGCCGCCAGTGTGATGGATACAAGCrri'Ci"rrri"rri"rriT 

ATTTTCGNhrmTTTTTC 


7 


PSL 
K31 


93 


ACTCAGTAGGGACTGAGCACTAAATGCTTATTTTAAAAGAAATGTAAAGAG 
CAGAAAGCAATTCAGGCTACCCTGCCTTTTGTGCTGGCTAGT 


8 


PSL 
L28 


169 


ACACTTAAAATAGTTAATGTGATACATTTTATGTTACATGTATTT^ 
TGAAAAAATAAAAATATATAAACACACAGCAAATGATGACCAGGCCTTTGA 
A fi A A A nrrr AT A A A A r A A A ATT A A fi A A fiCrTfrfiTT A r A fi A ffPfr A fi A PTPTfi 

TCTCAAAAAAAAAAA 


9 


PSL 
L74 


262 


ACTTrACAAGCATGAAGGATATTAGGGTAAGTGGCTAATTATAAATCTACT 
CTAGAGACATATAATCATACAGATTATTCATAAAATTmCAGTGCTGTCCT 
TrrAPATTTA ATTfirATTTTfirTrA A AnvrTAfi A ATfirprTArATTrrprrr 

ACCCCAATTTGCTATTTOiriTATTAAAATAGAAAATTATA^^ 

TTATATGCGTTCCTCITCCTGAAATTATAACATTTCTAAACITACCCACGTAG 

GT 


10 


PSL 
SSH20 


175 


ACAGGTTGGCCCTTCACCTAGTTGACTCAGCCCTCGATAGTCTAGAGCCCAC 
CCCCTCCTCAGGAACTCAAGAGCTCAGCATTTATAATGAGCAGTTGGTAAT 
GAGTTGCCCTATGTGCTTGTCGCAAGCAGTCACAGAGATGAGCCCTATTACT 
TfiATATTPAfifiA APA A AfifiT 


11 


PSL 
SSH4 


331 


acatccaagccttcctctgcgtgagagcaaaggctttgctcatcagccagcc 

agtcttgttactatctggctactititaaggttaaaaaat;^^ 

critgctctgcaggcggcaaggcaggaggcgcaggcctctrcattgttcac 

atgtcacaggaggaggctctgagcaaaggccactggcaagttagggcaac 

accaagaaggctctgcggagagactccctgtgggttgggggsctggcagga 

ACGGTGCcTGTGGACTGTTTATGGTCTGTCCAGTTGAGGCTTGGTAAACCCA 
AGTAAAGTGTTAAAAACCTCAGT 


12 


PSL 
SSH9 


170 


acgactcatccacctccggctgaagctccaggagctgaaggaccccaatga 
ggatgagccaaacatccgagtgctccitgagcaccx3citrtacaaggagaa 
gagcaagagcgtcaagcagacctgtgacaagtgtaacaccatcatctgggq 
gctcattcagacctggt 



Figure 3 
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EX0N_1 83bp' 

1 ACGCGGGGGATCC7VGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCTACGGCAGC 
6 1 CACCCTGCAACCGCCAGTCGGAG 



EX0N__2 61bp 

■ 1 AGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGCAAGGAGACATTGTCCCA 
61 G 



EX0N_3 525bp 

1 GATATTCrrGGTGATCTTGGAAGTGTCCGTATCATGGAATCAATCTCTATGATGGGAAGC 

61 CCTAAGAGCCTTAGTGAAACTTTTTTACCTAATGGCATAAATGGTATCAAAGATGCAAGG 

121 AAGGTCACTGTAGGTGTGATTGGAAGTGGAGATTTTGCCAAATCC^^ 

181 ATTAGATGCGGCTATCy^TGTGGTCATAGGAAGTAGAAATCCTAAGTTTGCTTCTGAAT^ 

241 TTTCCTCATGTGGTAGATGTCACTCATCATGAAGATGCTCTCACAAAAAC^^ 

301 TTTGTTGCTATACACAGAGAACATTATACCTCCCTGTGGGACCTGAGACATCTGCm^ 

361 GGTAAAATCCTGATTGATGTGAGCAATAACATGAGGATAAACCAGTACCCAGAATCCAA 

421 GCTGAATATTTGGCOTCATTATTCCCAGATTCTTTGATTGTC^^ 

481 TCAGCTTGGGCACTTCAGTTAGGACCTAAGGATGCCAGCCGGCAG 



EX0N_4 528bp 

1 GTTTATATATGCAGCAACAATATTCAAGCGCGACAACAGGTTATTGAACTTGCCCGCCAG 

61 TTGAATTTCATTCCCATTGACTTGGGATCCTTATCATCAGCC^ 

121 CCCCTACGACTCTTTACTCTCTGGAGAGGGCCAGTGGTGGTAGCTATAAGCTTGGCCACA 

181 TTTTTTTTCCTTTATTCCTTTGTCAGAGATGTGATTC^ 

24 1 AGTGACITTTACAAAATTCCrATAGAGATTGTGAATAAAACCra 

301 ACTTTGCTCTCCCTAGTATACCTCGCAGGTCTTCTGGOVGCTGCTTATCy^CTT^ 

361 GGCACCAAGTATAGGAGATTTCCACCTTGGTTGGAAACCTGGTTACAGTGTAGA 

421 CrrGGATTACTAAGTTTTTTCTTCGCTATGGTCC^ 

481 ATGAGAi^GTCAGAGAGATATTTGTTTCTCyU^CaVTGGCTTATC^ 

EX0N_5 165bp 

1 GTTGATGCAAATATTGAAAACTCTTGGAATGAGGAAGAAGTTTGGAGAATTGA^ 

61 ATCTCCTTTGGCATAATGAGCClTGGCTTACTTTCCCTCCro 

12 1 TOVGTGAGCAATGCTTTAAACTGGAGAGAATTGAGTTn 



EX0N_6 148bp 

1 TCTACACTTGGATATGTCGCTCTGCrCATJ^GTACTTTCCATGTTTTAATTTATC 

6 1 AAACGAGCTTTTGAGGAAGAGTACTACAGATTTTATACACCACCTUU^C 

121 CTTGTTTTGCCCTCAATTGTAATTCTGG 

cont_6+UTR 

1 gtaagattattttattccttccatgtataagccg:aaagctaaaacgaattaaaaaag^ 

6 1 gggaaaagagccaatttctggaagaaggtattggaggaacaattcctcatgtctccccgg 

121 agagggtcacagtaatgtgatgataaatggtgttcacagctgccatataaagttcta 

181 atgccattatttttatgacttctacgttcagttacaagtatgctgtcaaat^ 

241 ttgaaacttgttaaatgagatttcaactgacttagtgatagagttttcot 

301 ttcact^aatgtcatgtttgcgaatatgaattttt^ 

361 gtatgttttgttttgttttgcacaactgtaaccctgt^ 

421 gacaaaaatacttaca^gttaataatatagatataatgttaaaaacaal^ 

481 AGAATTTTAAGCTTTTAAAATAATTCAATGGATATACATTTTTTTCTGA^ 

541 TTAATTATTCAACTTAAAAAGTAGAAATGCATTATTATACaVTT^ 

601 GTTATGTTAGCATCTAGGTAAGGCTGCATGATAGCATTCCTATATTTCTCTCyVTAA^ 

661 GGATTTGAAGGATGAAATTAATTGTATGAAGCAATGTGATTATATGAAGAGAC^ 

721 AAAAAGACyVAATTAAACCTGAAATTATATTTAAAATATATTTGAQAC^T^ 

781 TGATAATACATACCTCATGAAAGATTTTATTCTTTATTGTGTTAC^ 

841 TCATATTAATATACTGATCAGGAAGAGGATTCAGTAACyiTTTGGC^ 

901 CTCTT^TACGGTACCAATCCTAGGAACTGTATACTAGTTCCTACTTAGAACAAAAGTATC 



FIGURE 4C 
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961 AAGTTTGCACACAAGTAATCTGCCAGCTGACCTTTGTCGCACCTTAACCAGTC^ 

1021 GCTATGGTATAGGATTATACTGATGTTCTTTGAGGGATTCTGATGTGCTAGGCATGGT^ 

1081 TAAGTACTTTACTTGTATTATCCCATTTAATACTTAGAACAACCCCGTGAGATAAGTAGT 

1141 TATTATCCTCATTTTACACATGAGGGACCGAAGGATAGAAAAGTTATTTT^ 

1201 TGOVGTTAATAAATGGCAGAGTGAGCATTCAAGTCCAGGTAGTCATATTCGAGAG^ 

1261 GGTTTTAACCACTAGGCTCTAGAGCTCCCGCCGCGCCCCTATGCATTATGTTCACAATGC 

1321 CAATCTAGATGCTTCCTCITTTGTATAAAGTCACTGACATTCTTTAGAGTGGGT^ 

1381 CATCCATU^AATGTATAAAAATATTATTATAATAAACTTATTACTGCnTGTAGGGTAA^ 

1441 ACAGTTACTTACCCTATTCTTGCTTGGAACATGAGCCTGGAGACCCATGGCAGTCCATAT 

1501 GCCrCCCTATGCAGTGAAGGGCCCTAGCAGTGTTAACTVAATTGCTGAGATCCC^ 

1561 CTTTC7VAAAATCTCTGTAGAGTTAGTCTTCTCCTTTTCTCTTCCT 

1621 CTGCATAACCATTCATTAGGGAGTACTTTACAAGCATGAAGGATATTAGGGTAAGT^ 

1681 AATTATAAATCTACTCTAGAGACATATAATCATACAGATTATTCATAAAAIUUUTCAGTG 

1741 CTGTCCTTCCACATTTAATTGCATTTTGCTCAAACTGTAGAATGCCCTAC^ 

1801 CCCCAATTTGCTATTTCCTTATTAAAATAGAAAATTATAGGCAAG^ 

1861 TTCCTCTTCCTGAAATTATAACATTTCTAAACTTACCa^CGTAGGGACT 

1921 CTGCCT^CAATAAAAAGACTTTTATTTAGTAGAGGCTACCT^ 

1981 TTCTACAACTGCCTTGTCyVGTTTGGTAATTCACraATGATTTTCTAATGTTCTCT^ 

2041 AATTTTATTATCTTCGACCCTCrTTTTTTTTT^^ 

2101 CCCATTGCTCTCGTTTGGGCAACa^GAGTGAAACTCTTGTCTC^^ 

2161 AGGTTTAAGACy^GTTTTGTCATTACTGGTGGGATCTGGTCACACAA 

2221 TGACATGGCACATAAAATTGGTTAAAAAATTTTGTTTTrT^ 

2281 CAACy^CACTTTATGCAAGATTGGAATGTATCTTCAAATTC^ 

2341 AAGATCCTCTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 4C 



wo 01/72962 



PCTAJSOl/09410 



7/40 

1 ACGCGGGGGATCCAGCTTGGGTAGGCGGGGAACXrAGCTGGAGTGCGACCGCTACGGCAGC 

61 CACCCTGCAACCGCCAGTCGGAGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAA 

12 1 GAAGGCAAGGAGACATTGTCCCAGGATATTCTTGGTGATCrrGGAAGTGTCCGTATC^ 

181 GAATCAATCTCTATGATGGGAAGCCCTAAGAGCCTTAGTGAAACTTGTTTACCTAAT^^ 

241 ATAAATGGTATCAAAGATGCAAGGAAGGTCACTGTAGGTGTGATTGGAAGTGGAGAT^^ 

301 GCCAAATCCTTGACCATTCGACTTATTAGATGCGGCTATGATGTGGTCATAGGAAG 

361 AATCCTAAGTTTGCTTCTGAATTTTTTCCTCATGTGGTAGATGTCACTC^ 

421 GCTCTCACAAAAACAAATATAATATTTGTTGCTATACACAGAGAACATTATACCTCCCTC 

481 tgggacctgagacatctgcttgtgggtaaaatcctgattgatgtgagcaat;^ 

541 ataaaccagtacccagaatccaatgctgaatatttggcttcattattcccagato 

601 attgtcaaaggatttaatgttgtctcagcttgggcacotcagttaggacct;^ 

661 AGCCGGCAGGTTTATATATGCAGCAACAATATTCAAGCGCGACAA(^ 

721 GCCCGCCTVGTTGAATTTCATTCCCATTGACTTGGGATCCra 

781 GAAAATTTACCCCTACGACTCTTTACTTTCTGGAGAGGGCCAGTGGTGGTAGCTATAAGC 

841 TTGGCCACATTTTTTTTCCriTTATTCCTTTGTCA^ 

901 AACCAACAGAGTGACTTTTACAT^TTCCTATAGAGATTGTGAATAAAACCTTACCTA 

961 GTTGCCATTACTTTGCTCTCCCTAGTATACCTTGCAGGTCTTCTGGCAGCTGCTTATCT^ 

1021 CTTTATTACGGCACCAAGTATAGGAGATTTCCyVCCTTGGTTGGAAACCTGGTT^ 

1081 AGAAAACAGCTTGGATTACTAAGTTTTTTCTTCGCTATGGTCCATGTTGCCTAC^ 

1 14 1 TGCTTACOBATGAGAAGGTCAGAGAGATATTTGTTTCTCAACATGGCTTATCAG 

12 0 1 CATGCAAATATTGAAAACTCITGGAATGAGGAAGAAGTTTGGAGAAOT 

1261 TCCTTTGGCATAATGAGCCTTGGCTTACTTTCCCTCCrrGGCAGTCAC 

1321 GTGAGCAATGCCTTAAACTGGAGAGAATTCAGTTTTATTCAGTCTAC^ 

1381 GCTCTGCTCATAAGTACnTrcCATGTTTTAAITrATGGATGGAAACGA 

1441 GAGTACTAO^GATTTTATACACCACCAAACTTTGTTCTTGCTCTTGTT^ 

1501 GTAATTCTGGGTAAGATTATTTTATTCCOTCCATGTATAAGCCGAT^GCTAA^ 

1561 AAAAAAGGCraGGAAAAGAGCCAATTTCTGGAAGAAGGTATTGGAGGAACAAT^ 

1621 GTCTCCCCGGAGAGGGTCAO^GTAATGTGATGATAAATGGTGTTCACAGCTGCCy^TATi^ 

1681 AGTTCTACTCATGCa^TTATTTTTATGACTTCTAC^ 

174 1 TTATCGTGGGTTGAAACTTGTTAAATGAGATTTCAACTGACTTAGT^ 

1801 CAAGTTAATTTTCACAAATGTCTVTGTTTGCCAATATGAATTT^ 

-1861 TGTAATTTAGGTATGTTTTGTTTTGTTTTGCACAACTGTAACCCTGTTC 

1921 TTCATAATCAGACAAAAATACTTACAGTTAATAATATAGATATAATGTTAA 

1981 GCAAACCAGCAGAATTTTAAGCTTTTAAAATAATTCAATGGATAT^ 

2041 GATTAAGATTTTAATTATTCT^CTTAAAAAGTAGAAATGCATTATTATA 

2101 GAAAGGACACGTTATGTTAGCATCTAGGTAAGGCTGCATGATAGCATTCCrATAT^ 

2161 TCATAAAATAGGATTTGAAGGAT6AAATTAATTGTATGAAGCAATGTGATTATATGAAGA 

2221 GACACAAATTAAAAAGACAAATTAAACCTGAAATTATATTTAAAATATA^ 

2281 AAATACATACTGATAATAO^TACCTCATGAAAGATTTTATTCTTTATTGTGTTACA^ 

2341 AGTTTCATTTTCATATTAATATACaXSATCAGGAAGAGGATTCAGTAACATn^ 

2401 AAACTGCTATCTCTAATACGGTACCAATCCTAGGAACTGTATACTAGTTCCTACTTAGAA 

2461 CAAAAGTATCAAGTTrGCACACT^GTAATCTGCCAGCTGACCTITO 

2521 GTCACCACTTGCTATGGTATAGGATTATACTGATGTTCITTGAGGG^ 

2581 GGCATGGTTCTAAGTACTTTACTTGTATTATCCCATTTAATACTTAGAACAA 

2641 GATAAGTAGTTATTATCCTCATTTTACaCATGAGGGACCGAAGGATAGAAAAGTTATT^ 

2701 TCAAAGGTCTTGCAGTTAATAAATGGCAGAGTGAGCATTCAAGTCCAGGTAGTCATATO^ 

2761 CAGAGGCCACGGTTTTAACCACrAGGCTCTAGAGCTCCCGCCGCGCC^^ 

2821 TTCACAATGCaVATCrAGATGCTTCCTCTTTTGTATAAAGTCACTGACATTCT^ 

2881 GGGTTGGGTGCATCCAAAAATGTATAAAAATATTATTATAATAAACTTATTAC^ 

2941 AGGGTAATTCACAGTTACTTACCCTATTCTTGCTTGGAACATGAGCCTGGAGACCC^^ 

3001 CAGTCCATATGCCTCCCTATGCAGTGAAGGGCCCTAGCAGTGTTAACAAATTGCTGAGAT 

3061 CCCACGGAGTCTTTO^AAAATCTCTGTAGAGTTAGTCTTCTCCTTTTCTC^ 

3121 GTTCTCCTGCCTGCATAACCATTCATTAGGGAGTACTTTACAAGCATGAAGGATATTAGG 

3181 GTAAGTGGCTAATTATAAATCTACTCTAGAGACATATAATCATACAGATTATTCATAAAA 

3241 TTTTTCZAGTGCTGTCCTTCCAa\TTTAATTGCATT^ 

3301 ATTCCCCCCACCCCAATTTGCTATTTCCTTATTAAAAATGTATAATUATATTAI^^ 

3361 AAACTTATTACTGCTTGTAGGGTAATTCACAGTTACnn^ACCCTATTCOTGCT^ 
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3421 GAGCCTGGAGACCCATGGCAGTCCATATGCCTCCCTATGCAGTGAAGGGCCCTAGCAGTG 

3481 TTAACAAATTGCTGAGATCCCACGGAGTCTTTCAAAAATCTCTGTAGAGT^ 

3541 CTTTTCTCrTCCTGAGAAGTTCrCCItSCCTGaiTAACCATTC^ 

3601 AGCATGAAGGATATTAGGGTAAGTGGCTAATTATAAATCTACTCTAGAGACATATAATCA 

3661 TACAGATTATTCATAAAATTTTTCAGTGCTGTCCTTCCACATTTAATTGCAT^ 

3721 AACTGTAGAATGCCCTACATTCCCCCCACCCCAATTTGCTATTTCCTTATTAAAATAG^ 

3781 AATTATAGGCAAGATACAATTATATGCGTTCCrrCTTCCTGAAATTATAACATT^ 

3841 TTACCCACGTAGGGACTACTGAATCCT^CTGCCAACAATAAAAAGACTTTO 

3901 AGGCTACCTTTCCCCCCAGTGACTCITTTTCrACAACra 

3961 CTTATGATTTTCTAATGTTCTCITCGTGAATTTTATTATCT^ 

4021 TTTTTAAAGACAGAGTCTTGCTCTGTCACCCATTGCTCTCGTTTGGGCAA 

4081 ACTCTTGTCTCAAAAAAAAAAAAAAATGAGGlTrAAGACAGT^ 

4141 ATCKKSTCACACAAGATAGCATTAAACGTGACyiTGGC^ 

4201 TGTTTTTTAATTGCGTAATGTAAAAGCCCAACAAACACTTTATC 

4261 CTTO^TTCAGATTTAATAAACATGTAAAGATCCTCTGTAAAAAAAAAAAAAAAAAA^ 

4321 AAAAAAAAA 



FIGURE 4D 



wo 01/72962 



9/40 



PCT/USOl/09410 



1 ACGCGGGGGATCCAGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCTACGGCAGC 

61 CACCCTGCAACCGCCAGTCGGAGAGCTAAGGGC^GTCCTGAGGTTGGGGCCy^GGtflGAT^ 

1 M 

12 1 GAAGGCAAGGAGACATTGTCCCAGGATATTCTTGGTGATCriTGGAAGTGTCCG 

2 ESISMMGSPKSLSETCLPNG 

181 GAATCAATCTCTATGATGGGAAGCCCTAAGAGCCTTAGTGAAACTTGTTTACCTAATGGC 

22 INGIKDARKVTVGVIGS6DF 

241 ATAAATGGTATCAAAOATGCAAGGAAGGTCACTGTAGGTGTGATTGGAAGTGGAGAT^ 

42 AKSLTIRLIRCGYHVVIGSR 

301 GCCAAATCCOTGACC?VTTCGACTTATTAGATGCGGCrAT^ 

62 NPKFASEFFPHVVDVTHHED 
361 AATCCTAAGTTTGCTTCTGAATTTTTTCCTCATGTGGTAGATGTC^ 

82 ALTKTNIIFVAIHREHYTSL 

421 GCTCTCACAAAAACAAATATAATATTTGTTGCTATACAC^ 

102 WDLRHLLVGKIliIDVSNNMR 

481 TGGGACCTGAGACATCTGCTTGTGGGTAAAATCCTGATTGATGTGAGCAATAACATC^ 

122 INQYPESNAEYLASLFPDSL 
541 ATAT^CCAGTACCCAGAATCCAATGCTGAATATTTGGCTTCATTATTC^ 

142 IVKGFNVVSAWAL.QLG PK DA 

601 ATTGTCAAAGGATTTAATGTTGTCTCAGCTTGGGCACTO 

162 SRQVYICSNNIQARQQVIEL 

661 AGCCGGCAGGTTTATATATGCAGCAACAATATTCAAGCXXrGAC^ 

182 ARQLNFIPIDLGSLSSAREX 
721 GCCCGCCAGTTGAATTTCATTCCCATTGACTTGGGATCCTTATCATCA 

202 ENLPLRLFTFWRGPVVVAIS 

781 GAAAATTTACCCCTACGACTCriTTACTTTCTGGAGAGGGCCAGTGGTGGTA^ 

222 LATFFFLYSFVRDVIHPYAR 

841 TTGGCCACATTTTTTTTCCTTTATTCCTTT 

242 NQQSDFYKIPIEIVNKTLPI 
901 AACCAACAGAGTGACTTTTACAAAATTCCTATAGAGATTGTGAATAAAACCTTACC^^ 

262 VAITLLSLVYLAGLLAAAYQ 

961 GTTGCCATTACTTTGCTCTCCCTAGTATACCTTGCAGGTCTTCTGGCAGCTGCTTATC^ 

282 LYYGTKYRRFPPWLETWIiQC 

1021 CTTTATTACGGCACCAAGTATAGGAGATTTCCACCTTGGTTGG^^ 

302 RKQLGLIjSFFFAMVHVAYSI; 
1081 AGAAAACAGCTTGGATTACTAAGTTTTTTCTTa3CTATOT 

322 CLPMRRSERYLFIiNMAYQQV 

1141 TGCTTACCGATGAGAAGGTCAGAGAGATATTTGTTTCTCAACATGGCTTATC^ 
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342 HANIENSWUEEEVWRIEMYI 

1201 CATGC^^TATTGAAAACrOTTGGAATGAGQAAGAAGTTTC 

362 SFGIMSLGLLSLIiAVTSIPS 

1261 TCCTTTGG»TAATGAGCCTTGGCrrACTTTCCCTCCTGGC^ 

382 VSNALNWREPSFIQSTLGYV 

1321 GTGAGCAATGCCTTAAACTGGAGAGAATTCAGTTTTATTC^GTCTACACTTGGATATGTC 

402 ALLISTFHVLIYGWKRAFEE 

1381 GCrrCTGCTCATAAGTACTTTCCATGTTTTAATTTATG 

422 EYYRFYTPPNFVLAIiVLPSI 

1441 GAGTACTAa^GATTTTATACACCACCAAACTTTCTTCTTC^ 

442 VILGKI ILPLPCISRKLKRI 

1501 GTAATTCTGGGTAAGATTATTTTATTCCTTCCATGTATAAGCCGAAAGCTAAAACG^ 

462 KKGWEKSQFLEEGIGGTIPH 

1561 AAAAAAGGCTGGGAAAAGAGCCS^TTTCTGGAAGAAGGTATTGGAGGi^ 

482 VSPERVTVM* 

1621 GTCrcCCCGGAGAGGGTCACAGTAATGTGATGATAAATGGTGTTCACAGC 
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EX0N_1 75bp 

1 GATCCTVGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCCGCGGCAGCCACCCro 

6 1 AACCGCCAGTCGGAG 

EX0N_2 79bp 

1 AGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGCAAGGAGAC^ 

61 CAGGTAGGATGTGTCCCAG 

EX0N_3 525bp 

1 GATATTCTTGGTQATCTTGGAAGTGTCCGTATCATGGAATCAATCTCTATGATGGGAAGC 

61 CCTAAGAGCCTTAGTGAAACTTTTTTACCTAATGGCATAAATGGTATCAAAGATGCAAGG 

121 AAGGTCACTGTAGGTGTGATTGGAAGTGGAGATTTTGCCAAATCCTTGACCA^ 

181 ATTAGATGCGGCTATCATGTGGTCATAGGAA6TAGAAATCCTAAGTTTGCTTCTC 

241 TTTCCTCATGTGGTAGATGTCACTCATCATGAAGATGCTCTCACAAAAACAAATATAA 

301 TTTGTTGCTATACTVCAGAGAACATTATACCTCCCTGTGGGACCTGAGACTVTCTGCT^ 

361 GGTAAAATCCTGATTGATGTGAGCAATAACATGAGGATAAACCAGTACCCAGAATCCAAT 

421 GCTGAATATTTGGCTTCATTATTCCCAGATTCrrTTGATTGTCAAAGGATa^ 

481 TCAGCTTGGGCACTTCAGTTAGGACCTAAGGATGCCAGCCGGCAG 

EXGN_4 528bp 

1 GTTTATATATGCAGCAACAATATTGAAGCGCGAC7\ACAGGTTATTG^ 

6 1 TTGAATTTCATTCCCATTGACTTGGGATCCITATCATCAGCC^ 

121 CCCCTACGACTCTTTACTCTCTGGAGAGGGCCAGTGGTGGTAGCTATAAGCTTGGCCACA 

181 TTTTTTTTCCTTTATTCCTTTGTCAGAGATGTGATTCATC^ 

241 AGTGACITTTACAAAATTCCTATAGAGATTGTGAATAAAACCra 

301 ACTTTGCTCTCCCTAGTATACCTCGCAGGTCTTCTGGCAGCTGCTTAT(^^ 

361 GGCACCa\AGTATAGGAGATTTCCACCTTGGTTGGAAACCTGGTTAC^ 

421 CTTGGATTACTAAGTTTTTTCTTCGCTAT(^GTCCATGTTGCCTAa^GC 

481 ATGAGAAGGTCAGAGAGATATTTGTTTCrCAACATGGCTTATCa^GCAG 

EX0N__5 165bp 

1 GTTCATGCAAATATTGAAAACTCTTGGAATGAGGAAGAAGTTO 

6 1 ATCTCCTTTGGCATT^TGAGCCTTGGCITACTTTCCCTCCTGGC^ 

12 1 TCAGTGAGCy^TGCTTTAAACTGGAGAGAATTCAGTTTTATTCAG 

EXON__6 148bp 

1 TCTACACITGGATATGTCGCTCTGCTCy^TAAGTACTTTCCATGTTO 

6 1 AAACGAGCTTTTGAGGAAGAGTACTACAGATTTTATACACCACCAAAC^^ 

12 1 CTTGTTTTGCCCTCAATTGTAATTCTGG 

EX0N_7 +UTR 7 1 8bp 

1 AtCTTTTGCAGCTTTGCyVGATACCCAGACTGAGCTGGAACTGGAATTrGTCT 

61 ACTCTACTTCTTTAAAAGCGGCTGCCCATTACATTCCTaVGCTGTCCTTG^ 

121 TACATGTGACTOAGTGTTGGCCAGTGAGATGAAGTCTCCTCAAAGGAAGGCAG^ 

181 CCTTTTTCATCCCTTCTlTCTTGCTGCTGGGATTGTGGATATAACAGGAGCCCaX^ 

241 GTCTCCAGAGGATCAAAGCCACACCCAAAGAGTAAGGCAGATTAGAGACCAGAAAGACCr 

301 TGACTACTTCCCTACTTCCACTGCTTTTTCCTGCATTTAAGCCATTGTAAATCTGGGTGT 

361 GTTAGATGAAGTGAAAATTAATTCTTTCTGCCCrrC^ 

421 ACACTGTCTGAATTAACTAGACTGCAATAATTCTTTCTTTTGAAAGCTT^ 

481 TGTGCAATTCACATTAAAATTGATTTTCCATTGTCAATO 

541 TTGATCTTTCATTAGATATTTTGTATCTGCTTGGAATATATTATCTTCTTTTT^ 

601 TAATTGGTAATTACTAAAACTCTGTAATCTCCAAAATATTGCTATCAAATTACACAC^ 

661 GTTTTCTATCATTCTCATAGATCTGCCTTATAAACATTTAAATAAAAAGTAC 
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1 GATCCAGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCCGCGGCAGCCACCC^^ 

61 AACCGCCAGTCGGAGAGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGC 

12 1 AAGGAGACT^TTGTCCCAGGTAGGATGTGTCCCAGGATATTCTTGGTGATCTTGGAAGTGT 

181 CCGTATCATGGAATCAATCTCTATGATGGGAAGCCCTAAGAGCCTTAGTGAAACTTT^ 

241 ACCTAATGGCATAAATGGTATCAAAGATGO^GGAAGGTCACTGTAGGTGTGATTG 

301 TGGAGATTTTGCCAAATCCTTGACCATTCGACTTATTAGATGCGGCTATCATGTGGTC^ 

361 AGGAAGTAGAAATCCTAAGTTTGCTTCTGAATTTTTTCCTCATGTGGTAGATGTCACTC^ 

421 TCATGAAGATGCTCTCACAAAAACAAATATAATATTTGTTGCTATAa\C7VGAGA^ 

481 TACCTCCCTGTGGGACCTGAGAGATCTGCTTGTGGGTAAAATCCTGATTGATGTGAGO^ 

541 TAACATGAGGATAAACCT^GTACCCAGAATCCAATGCTGAATATTTGGCa^ 

601 AGATTCrTTGATTGTCATVAGGATTTAATGTTGTCTCAGCTT^ 

661 TAAGGATGCCAGCOSGCAGGTTTATATATGCAGCAACAATATTCa^GCGCGACAAC^ 

721 TATTGAACTTGCCCGCCAGTTGAATTTCATTCCCy^TTGACTTGGGATCCTTATC^ 

781 CAGAGAGATTGAAAATTTACCCCTACGACTCTTTACTCTCTGGAGAGGGCCAGTGGTGGT 

841 AGCTATAAGCTTGGCCACATTTTTTTTCCTTTATTCCTTTGTCAGAGATGTGAT^ 

901 ATATGCTAGAAACCT^CAGAGTGACrTTTACAAAATTCCTAT^ 

961 CTTACCTATAGTTGCCATTACTTTGCTCTCCCTAGTATACCTCGCAGG 

1021 TGCTTATCAACirrATTACGGCyvCCAAGTATAGGAGATTTCC^ 

1081 GTTACAGTGTAGAAAACAGCTTGGATTACTAAGTTTTTTCOT 

1141 CTACAGCCrCTGCTTACCGATGAGAAGGTCAGAGAGATATTTGTTTCTC^ 

1201 TCAGCAGGTTCATGCAAATATTGAAAACTCTTGGAATGAGGAAGAAG 

1261 AATGTATATCTCCTTTGGCATAATGAGCCTTGGCTTACTTTCCCTCCTGGCAGTCACT^ 

1321 TATCCCTTCAGTGAGCAATGCTTTAAACTGGAGAGAATTCAGTT^ 

1381 TGGATATGTCGCTCTGCTCATAAGTACTTTCCATGTTTTAATTTATGGATGGAAACG^ 

1441 TTTTGAGGAAGAGTACTACAGATTTTATACACCyVCCAAA^^ 

1501 GCCCTCAATTGTAATTCTGGATCTTTTGCAGCnTTGCAGATACCC^ 

1561 TGGAATTTGTCTTCCTATimCTCfTACTTCTTTAAAAGCGGCTGCCCAT^ 

1621 GCTGTCCTTGCAGTTAGGTGTACATGTGACTGAGTGTTGGCCyiGTGAGA 

1681 CM^GGAAGGCyVGCATGTGTCCTTTTTCATCCCTTCA ' 

1741 TAACAGGAGCCCIXSGCAGCTGTCTCCAGAGGATCAAAGCCACAC^ 

1801 ATTAGAGACCAGAAAGACCTTGACTACTTCCCTACTTC^ 

1861 GCCATTGTAAATCTGGGTGTGTTACATGAAGTGAAAATTAATTCTTTC^^ 

1921 CTTTATCCTGATACCa^TTTAACACTGTCTGAATTAACTAGACTGC^ 

1981 TGAAAGCTTTTAAAGGATAATGTGCAATTCACATTAAAATTGA'r^^ 

2041 GTTATACTCy^TTrrCCTGCCETGATCTTTCyiTTAGATAOT 

2101 TTATCTTCTTTTTAACTGTGTAATTGGTAATTACTAAAACTCTGTAATCTCO^^ 
2161 GCTATCT^TTACyVCACCT^TGTITTCTATCATTCTCATAGATCT 

2221 AATAAAAAGTACTATTTA 
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1 GATCCAGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCCGCGGCAGCCACCCTGCA 

62 ACCGCCAGTCGGAGAGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGCA 

122 AGGAGACATTGTCCCAGGTAGGATGTGTCCCAGGATATTCTTGGTGATCTTGGAAGTGTC 

MESISMMGSPKSLSETCL 

182 CGTATCATGGAATCAATCTCTATGATGGGAAGCCCTAAGAGCCTTAGTGAAACTTGTTTA 

81 P NGINGIKDARKVTVGVIGS 

242 CCTAATGGCATAAATGGTATCAAAGATGCAAGGAAGGTCACTGTAGGTO^ 

101 GDFAKSIiTIRIiIRCGYHVVI 

302 ggagattttgccaaatccttgaccattcgacttattagatgcggctatcatgtggt^^ 

121 gsrnpkfasefpp-hvvdvth 
362 . ggaagtagaaatcctaagtttgcttctgaatttittcctcatgtggtagatgtcact 

141 hedaltktniifvaihrehy 

422 catgaagatgctctcao^aaaacaaatataatatttgttgcratac^ 

161 tslwdlrhllvgkiiiidvsn 

4 82 acctccctgtoggacctgagacatctgcttgtgggtaaaatcctgato 

181 nmrinqypesnaeylasiipp 

542 aacatgaggataaaco^gtacccagaatccaatggtgaatattt^ 

201 dslivkgfnvvs :awalqlgp 

602 gattctttgattgtcaaaggatttaatgttgtct 

221 kdasrqvyicsn n iqarqqv 

6 62 aaggatgccagccggcaggtttatatatgcagct^caatattca;^ 

241 lelarqlnfipidlgslssa 

722 attgaacttgcccgccagttgaatttcatrcccattgacttgggatcctta 

261 reienlplrlftlwrgpvvv 

782 agagag attgaaaatttacccctacgactctttactctctggagagggccagtggtggta 

281 aislatffflysfvrdvihp 

842 gctataagcttggccacattttttttcctttattccrttgtc^ 

301 yarnqqsdfykipieivnkt 

902 tatgctagaaaccaacagagtgacttttacaaaattcctatagagattgtgaataaaacc 

321 lpivaitllslvylagllaa 

962 ttacctatagttgccattacittgcictccctag^^ 

341 ayqlyygtkyrr fppwletw 

1022 gcttatcyu^ctttattacggcaccaagtataggagatttccacct^ | 

361 lqcrkqlgllsfffamvhva 

1082 ttacagtgtagaaaacagcttggattacta^gttttttcttosctatc 

381 YSliCLPMRRSERYLFLNMAY 



FIGURE 4H 



wo 01/72962 



PCT/USOl/09410 



14/40 

1142 TACAGCCTCTGCTTACCGATGAGAAGGTCAGAGAGATATTTGTTTCTCAA^ 

401 QQVHANIENSWNEEEVWRIE 

1202 CAGCAGGTTCATGCAAATATTGAAAACTCTTGGAATGAGGAAGAAGTTTG^ 

421 MYIS FGIMSLGLLSLLAVTS 

1262 ATGTATATCTCCTTTGGCATAATGAGCCTTGGCTTACTTTCCCTCCTGG^ 

441 IPSVSNAIiNWREFSFIQSTL 

1322 ATCCCTTCAGTGAGCAATGCTTTAAACTGGAGAGAATTCAGITTO 

461 GYVALLISTFHVIiIYGWKRA 

1382 GGATATGTCGCTCTGCTCATAAGTACTTTCCATGTTTTAATTTATGGATC^ 

481 FEEEYYRFYTPPNFVLALVL 

1442 TTTGAGGAAGAGTACTACAGATTTTATACACCaVCCAAACTT^ 

501 PSIVILDLLQLCRYPD- 

1502 CCCTCAATTGTAATTCTGGATCTTTTGCAGCTTTGCAGATACCCAGACTO 
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EX0N_1 75bp 

1 GATCCAGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCCGCGGCAGCCACCCTGC 

6 1 AACCGCCAGTCGGAG 

EX0N_2 79bp 

1 AGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGCAAGGAGACATTC 

6 1 CAGGTAGGATGTGTCCCAG 



EX0N_3 525bp 

1 GATATTCTTGGTGATCTTGGAAGTGTCCGTATCATGGAATCa^TCTCTATGATGGGAAG 

6 1 CCTAAGAGCCTTAGTGAAACTTTTTTACCTAATGGCATAAATGGTATCAAA^ 

121 AAGGTCACTGTAGGTGTGATreGAAGTGGAGATTTTGCCAAATCCTTGAC^^ 

181 ATTAGATGCGGCTATCATGTGGTCATAGGAAGTAGAAATCCTAAGTTTGCTTCTGAATTT 

241 TXTCCTCATGTGGTAGATGTCACTCATCATGAAGATGCTCTCA 

301 TTTGTTGCTATACACAGAGAACATTATACCTCCCTGTGGGACCTGAGACATC^ 

361 GGTAAAATCCTGATTGATGTGAGCAATAACATGAGGATAAACCAGTACCCAQAATCCAAT 

421 GCTGAATATTTGGCTTCATTATTCCCAGATTCTTTGATTGTCAAAGC^ 

481 TCAGCTTGGGCy^CTTCAGTTAGGACCTAAGGATGCCAGCCGGCAG 



EX0N_4 528bp 

1 GTTTATATATGCaGCAACaATATTCAAGCGCGACAACAGGra 

6 1 TTGAATTTCATTCCCATTGACTTGGGATCCTTATCATCAGCCAGAGAGATT^^ 

121 CCCCTACX3ACrCTTTACTCTCTGGAGAGGGCCAGTGGTGGTAGCTATAAGCTTGGCCAC^ 

181 TTTTTTTTCCTTTATTCCTTTCTCAGAGATGTGATT 

241 AGTGACTTTTACAAAATTCCTATAGAGATTGTGAATAAAACCTTA 

301 ACTTTGCTCTCCCTAGTATACCTCGCAGGTCTTCTGGCAGCTGCTTATCT^ 

361 GGCACCAAGTATAGGAGATTTCCACCTTGGTTGGAAACCTGGTTACAGTGTAGAAAACM 

421 CTTGGATTACTAAGTTTTTTCTTCGCTATGGTCCATGTT^ 

.481 ATGAGAAGGTCAGAGAGATATTTGTTTerCAACATiKC^ 

EX0N_5 165bp 

1 GTTCATGCAAATATTGAAAACTCTTGGAATGAGGAAGAAGTTTGGAG^ 

• 61 ATCrCCTTTGGCATAATGAGCCTTGGCTTACTTTCCCTCe^^ 

12 1 TCAGTGAGCAATGCTTTAAACTGGAGAGAATTCAGTTTTAl^ 



EX0N_7 and 3»trrR 718bp 

1 ATCTTTTGCAGCTTTGCAGATACCCAGACrGAGCTGGTUVCTGGAATTTGTCOT 

6 1 ACTCTACTTCTTTAAAAGCGGCTGCCCATTACATTCCTCAGCTGTCCTTGCAGTTAGGTC 

121 TACATGTOACTGAGTGTTGGCCAGTGAGATGAAGTCTCCTCAAAGGAAGGCA^ 

181 CCTTT'n'CATCCCTTCATCTTGCTGCTGGGATTGTGGATATAACAGGAGCCCTO 

241 GTCrCCAGAGGATCAAAGCCACACCCAAAGAGTAAGGCAGATTAGAGACCAGAAAGACCT 

301 tgactacttccctacitccactxk:tttttcctgcatttaagcc^ 

361 gttacatgaagtgaaaattaattctttctgcccttcagttcit^ 

421 ACACTGTCTGAATTAACTAGACTGCAATAATTCTTTCTTTTGAAAGCTTT^ 

481 TGTGCAATTCACATTAAT^TTGATTTTCCATTGTCAAT^ 

541 TTGATCTTTCAIT'AGATATTTTOTATCTGCTTGGAATATATTATCTTCT^ 

601 TAATTGGTAATTACTAAAACTCTGTAATCTCCAAAATATTGCTATCAAATTACACACCAT 

661 GTTTTCTATCATTCTCATAGATCTGCCTTATAAACa^TTTAAATAAAAAGTACTATTTA 
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1 GGATCCAGCTTGGGTAGGCGGGGAAGCAGCTGGAGTGCGACCGCTACGGCAGCCACCCTG 

6 1 aiACCGCCAGTCGGAGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGC^ 

121 AGGAGACATTGTCCCAGGATATTCTTGGTGATCTTGGAAGTGTCCGTATC^ 

181 TCTCTATGATGGGAAGCCCTAAGAGCCn^TAGTGAAACTTGTTTACCTAATGG^ 

241 GTATO^GATGCAAGGAAGGTCACTGTAGGTGTGATTGGAAGTGGAGATT^ 

301 CCTTGACCATTCGACTTATTAGATGCGGCTATCATGTGGTCATAGGAAGTAGAAATCCTA 

361 AGTTTGCTTdTGA AaTXTlT CCTCATGTGGTAGATGTCACTCATCATGAAGATC 

421 CAAAAACAAATATAATATTTGTTGCTATACACAGAGAACATTATACCTCCCTGTGGGACC 

481 TGAGACATCTGCTTGTGGGTAAAATCCTGATTGATGTGAGCAATAACT^TGAGG^ 

541 AGTACCCAGAATCCAATGCTGAATATTTGGCTTCATTATTCCCAGATTCrn^^ 

601 AAGGATTTAATGTTGTCTCAGCTTGGGCACTTCAGTTAGGACCTAAGGATGCC^ 

661 AGGTTTATATATGCAGCAACyVATATTCAAGCGCGACAACAGGITATTG^ 

721 AGTTGAATTTCATTCCCATTGACTTGGGATCCTTATCa^TCAGCCAGAGA 

781 TACCCCTACGACTCTTTACTTTCTGGAGAGGGCCAGTGGTCGTAGCTATAAGCTTGG^ 

841 CT^TTTTTTTTCCTTTATTCCTTTGTCAGAGATGTGATTC^ 

901 AGAGTGACTTTTACAAAATTCCTATAGAGATTGTGAATAAAACCTTACCTATAGTTGCCA 

961 TTACTTTGCTCTCCCTAGTATACCTTGCAGGTCTTCTGGCAGCTGCTTATC^ 

1021 ACGGCACCAAGTATAGGAGATTTCCACCTTGGTTGGAAACCTGGTTACAGTGTAGAAAA 

1081 AGCTTGGATTACTAAGTTTTTTCTTCGCTATGGTCCATGTTGCCTACy^GCCTCTGC^ 

1141 CGATGAGAAGGTCAGAGAGATATTTGTTTCTCAACATGGCTTATCAGCAGGTTC^ 

1201 ATATTGAAAACTCTTGGAATGAGGAAGAAGTTTGGAGAATTGAAATGTATATCTCCT 

1261 GCATAATGAGCCTTGGCTTACTTTCCCTCCTGGCyiGTCACTTCT 

1321 ATGCTTTAAACTGGAGAGAATTCAGTTTTATTCy^GATCrTTTGCAGCT^ 

1381 AQACraAGCTGGAACTGGAATTTGTCTTCCTATTGACTCrACTTC^ 

1441 CCT^TTACATTCCTCAGCTGTCCTTGCAGTTAGGTGTACATGTGACTGAGTGTTG^ 

1501 GAGATGAAGTCTCCTCAAAGGAAGGCAGCATGTGTCCTTITrCATC 

1561 CTGGGATTGTGGATATAACAGGAGCCCTGGCAGCTGCTCCAGAGGATCAAAGCCAC:^^ 

1621 AAAGAGTAAGGO^GATTAGAGACCAGAAAGACCTTGACTACTTCCCTAC^ 

1681 TTTCCTGCATTTAAGCCATTGTAAATCTGGGTGTGTTACATGAAGTGAAAATTAAOT 

1741 TCTGCCCTTCAGTTCTTTATCCTGATACCATTTAACACTGTCTG^ 

1801 ATAATTCTTTCTTTTGAAAGCTTTTAAAGGATAATG^ 

1861 TCCyVTTGTCAATTAGTTATACTCATTTTCCTGCCrrTG^ 

1921 CTGCTTCGAATATATTATCTTCTTTTTT^CTGTGTAATT^ 

1981 ATCTCCT^AAATATTGCTATCAAATTACACACCATGTTTTCTATC^ 

2041 CTTATAAACT^TTTAAATAAAAAGTACTATTTACCAAAAAAAAAAAAAAAAAAAAAAAA^ 

2101 AA 



FIGURE 4 J 



wo 01/72962 



17/40 



PCT/USOl/09410 



IGGATCCAGCTTGGGTAGGOSGGGAAGCAGCTGGAGTGCGACCGCTACGGCAGCC^ 

63 ACCGCCAGTCGGAGAGCTAAGGGCAAGTCCTGAGGTTGGGCCCAGGAGAAAGAAGGCA^ 

1 M E S I 

123 GAGACATTGTCCCAGGATATTCTTGGTGATCITGGAAGTGTCCGTATCATGGAAT 

5 SMMGSPKSLSETCLPNGING 

183 TCTATGATGGGAAGCCCTAAGAGCCTTAGTGAAACTTGTTTACCTAATGGCATAAATGGT 

25 IKDARKVTVGVIGSGDFAKS 

243 ATCAAAGATGCAAGGAAGGTCACTGTAGGTGTGATTGGAAGTGGAGATT^^ 

45 LTIRLIRCGYHVVIGSRNPK 

303 TTGACCATTCGACTTATTAGATGCGGCTATCATGTGGTCATAGGAAGTAGAAATCCT 

65 FASEFFPHVVDVTHHEDALT 

363 TTTGCTTCTGAATTTTTTCCTC^TGTGGTAGATGTCACTCATC^ 

85 KTNIIFVAIHREHYTSLWDL 

423 AAAACAAATATAATATTTGTTGCTATACa^CAGAGAACTlTTATACCTCC 

105 RHLIiVGKILIDVSNNMRINQ 

483 AGACATCKXrrrGTGGGTAAAATCCTGATTGATGTGAGC^ 

125 YPESNAEYLASLFPDS LIVK 

543 TACCCAGAATCCAATGCTGAATATTTGGCrTCATTATTCCCAGATTCT^ 

145 GFNVVSAWALQLGPKDASRQ- 

603 GGATTTAATGTTGTCTCAGCriTCGGCACTTCAGTTAGGACCT;^ 

165 VY I C SNN I QARQ QV l E^L ARQ 

663 GTTTATATATGCAGCAACAATATTCAAGaSCGACAACaVGGTTAT^^ 

185 LNFIPIDliGSLSSAREIENL 

723 TTGAATTTCATTCCCATTGACTTGGGATCCTTATCATCAGCCAGAGAGAT^^ 

205 PLRLFTFWRGPVVVAISLAT 

783 CCCCTACGACTCTTTACTTTCTGGAGAGGGCCAGTGGTGGTAGCrrATAAGCTTGGCCACA 

225 FFFLYSFVRDVIHPYARNQQ 

843 TTTTTTTTCCTTTATTCCITTGT(^^ 

245 SDFYKIPIEIVNKTLPIVAI 

903 AGTGACTTTTACAAAATTCCTATAGAGATTGTGAATAAAACCTTACCTATAGTTC 

265 TLLSLVYLAGLLAAAYQLYY 

963 ACTTTGCTCTCCCTAGTATACCTTGCAGGTCTTCTGGCAGCTGCTTATC^ 

285 GTKYRRFPPWLETWLQCRKQ 

L 0 2 3 GGCACCAAGTATAGGAGATTTCCavCCrTGGTTGGAAACCTGGT^ 

305 LGLLSFFFAMVHVAYSLCLP 

LO 8 3 CTTGGATTACTAAGTTTTTTCTTOSCTATGGTCCA 

325 MRRSERYLFLNMAYQQVHAN 
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1143 ATGAGAAGGTCAGAGAGATATTTGTTTCTCAACATGGCTTATCAGCAGGTTCA 

345 lENSWNEBEVWRIKMYISFG 

1203 ATTGAAAACTCTTGGAATGAGGAAGAAGTTTGGAGAATTGAAATGTATATCrCC^^ 

365 IMSLGLLSLLAVTS IPSVSN 

1263 ATAATGAGCCTTG6CTTACTTTCCCTCCTGGCAGTCACTTCTATCCCTTCGGTGAGCAAT 

385 ALNWREFSFIQIFCSFADTQ 

1323 GCTTTAAACTGGAGAGAATTGAGTTTTATTOIGATCTTTTGCAGCTT^ 

405 TELELEFVFLLTLLL* 

1383 ACTGAGCTGGAACTGGAATTTGTCTTCCTATTGACTCTACTTCTTTAAAAGCGGCTGCC^ 

1443 ATTACATTCCTCAGCTGTCCrTGCAGTTAGGTGTACATGTGACTGA^ 

1503 GATGAAGTCTCCTCAAAGGT^GGCAGCATGTGTCCTTTTTC^^ 

1563 GGGATTGTGGATATAACAGGAGCCCTGGCAGCTGCTCCAGAGGATCAAAGCCACACCCAA 

1623 AGAGTAAGGCAGATTAGAGACCAGAAAGACCTTGACTACITCCCTACTTCaVC^ 

1683 TCCTGCATTTAAGCCATTGTAAATCn^GGtGTGTTACATGAAGTGAAAATO 

1743 T6CCCTTCAGTTCTTTATCCTGATACCATTTAACACTGTCTGAATTAACTAGACTC 

1803 AATTCTTTCTTTTGAAAGCTTTTAAAGGATAATGTGC^ 

1863 CATTGTCAATTAGTTATACTCATTTTCCTGCCTTGATCTTTCATTAa^ 

1923 GCTTGGAATATATTATCTTCTTTTTAACTGTGTAATTGGTA^ 

1983 CTCCAAAATATTGCTATCAAATTACACACCATGTTTO 

2043 TATAAACATTTAAATAAAAAGTACTATTTACCAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 MEKTCIDALPLTMNSSEK 
1 ACAGATCTATGG^GAAAACOTGTATAGATGCACTTCCTCTTACTATGAATTC^ 

19 QETVCIFGTGDFGRSLGLKM 

6 3 CAAGAGACTGTATGTATTTTTGGAACTGGTGATTTTGGAAGATCACTGGGATTGAAAATG 

39 LQCGYSVVFGSRNPQKTTLL 

12 3 CTCCAGTGTGGTTATTCTGTTGTTTTTGGAAGTCGAAACCCCCAGAAGACCACCCTACTG 

59 PSGAEVIiSY'SEAAKKSDIII 

183 CCCAGTGGTGCAGAAGTCTTGAGCTATTCAGAAGCAGCCAAGAAGTCnX3ACATCATJ^ 

79 lAIHREHYDFLTELTEVLNG 

243 ATAGCa^TCCACAGAGAGCATTATGATTOTCTCACAGAATTAACT 

99 KILVDISNNLKINQYPESNA 

303 AAAATATTGGTAGACATCAGCAACAACCTCAAAATCAATCAATATCCA^ • 

119 E Y I, A/T HLVPGAHVVKAFNTIS 

363 GAGTACCTTGCTCATTTGGTGCCAGGAGCCCACGTGGTAAAAGCATTTAAC^ 

139 AWALQSGALDASRQVFVCGN 

423 GCCTGGGCTCTCCa.GTaiGGAGCACTOGATGCAAGTCGGCAGGTGTTTGT^ 

159 DSKAKQRVMDIVRNLGLTPM 

483 GACAGCAAAGCCAAGOVAAGAGTGATGGATATTGTTCGTAATCTTGGACOT 

179 DQGSLMAAK EIEKYPIiQLFP 

543 GATCAAGGATCACTOVTGGCAGCCAAAGAAATTGAAAAGTACCCCCTGCAGCTATTTC 

199 MWRFPPYLS AVLCVFLFFYC 

603 ATGTGGAGGTTCCCCnrCrrATTTGTCTGCTGTGCTGTGTG^ 

219 VIRDVIYPYVYEKKDNTFRM 

663 GTTATAAGAGACGTAATCTACCCTTATGTTTATGAAAAGAAAGATAATACATTTCGTATG 

239 AISIPNRIFPITALTLLALV 

723 GCTATTTCCATTCCAAATCGTATCTTTCCAATAAOJlGCTVCTTACACTGCTTGCTTO 

259 YLPGVI AAILQLYRGTKYRR 

783 TACCTCCCTGGTGITATTGCTOCCATTCTACAACTGTACCGAGGCACAAAATACCGTCGA 

279 FPDWLDHWMLCRKQLGIiVAL 

843 TTCCCAGACTGGCTTGACCACTGGATGCITTGCCGZ^GCAGCTT^ 

299 GFAPLHVLYTLVIPIRYYVR 

903 GGATTTGCCTTCCTTCATGTCCTCTACACACTTGTGATTCCTATTCX3ATATTATGTACG^ 

319 WRIiGNLTVTQAI P/L K K E N P F S 

9 63 TGGAGATTGGGAAACTTAACCGTTACCCAGGCAATACCCAAGAAGGAGA^ 

339 TSSAWLSDSYVALGILGFFL 

1023 ACCTCCTCT^GCCTGGCTCAGTGATTCTVTATGTGGCTTTGGGAATAC^^ 

359 FVIjLGITSLPSVSNAVNWRE 



FIGURE 4L 



wo 01/72962 



PCTAJSOl/09410 



20/40 

1083 TTTGTACTCTTGGGAATCACTTCTTTGCCATCTGTTAGCAAT^^ 

379 FRFVQSKLGYLTLILCTAHT 

114 3 TTCCGATTTGTCCAGTCCT^CTGGGTTATTTGACCCTGATCTTGTG 

399 LVYGGKRFLSPSNLRWYLPA 

1203 CTGGTGTACGGTGGGAAGAGATTCCTCy^GCCCTTCTU^TCTCAGATGGTATC 

429 AYVLGLIIPCTVIiVIKFVLI 

1263 GCCTACGTGTTAGGGCTTATCATTCCTTGCACTGTGCraGTGATCAA 

439 MPCVDNTLTRIRQGWERNSK 

1323 ATGCCATGTGTAGACAACACCCTTACAAGGATCCGCCy^GGGCTGGGAAAGGAACTCAA^ 

459 H - 

1383 CACTAGCTCGAGGT 
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lACCCTTCGCCGCGC3ACCTTCa^TGCCGCC3GTCGCrrcCGAGC^ 

63 AAGCGATTCTCXrrGCTTCAQCCTCCGQAGTAGOTGGC3ATTACAGGCAOT 

1 MPEEMDKPLISLHLVD 

L2 3 CCaGCCACCAAAATGCaVGAAGAGATGGACAAGCCACTGATCAGCX:TCC^ 

17 SDSSLAKVPDEAPKVGILGS 

183 AGCGATAGTAGCCTTGCCAAGGTCCCCGATGAGGCCCCCAAAGTQGGCATCCTGGGTAGC 

37 GDFARSLATRIiVGSGFKVVV 

24 3 GGGGACirrGCCCGCTCCCTGGCCACACXSCCTGGTGGGCTCri^CrrTC^^ 

57 GSRNPKRTARLYPSAAQVTP 

303 GGGAGCCGO^ACCCCAAACGCACAGCCAGGCTQTATCCCTCAGCGGCCCAAGTGACTTTC 

77 QBEAVSSPEVIFVAVPREHy 

363 CAAGAGGAGGCAQTGAQCTCCCCGGAGGTCATCrTTGTaGCTGTGTTCCGGGAQCACT 

S7 SSLCSLSDQLAGKILVDVSW 

423 TCTTCACTGTGCAGTCTCAGTGACCAGCTOGCGGGCAAGATCCT^ 

117 PTEQEHLQHRESNAEYLASL 

483 CCTACAQAGCAAGAGCACCTTCAGCATCGTGAGTCaUVTOCTGAG^ 

137 FPTCTVVKAFNVISAWTIjO A 

543 TKX:CX::ACTTCCACAGTGGTCAAGGCCTrC:AATC 

157 GPRDGNRQVPrCGDQPEAKR 

■603 GGCCCAAGGGATGGIT^OIGGCAGGTGCCCATCTQCGGTOACCAGCC^^ 

177 AVSEMALAMGFMPVDMGSIiA 

663 GCTOTCTCGGAGATGGCGCTCGCCa.TGGGCTTCATGCCCGT^^ 

197 SAWEVEAMPLRLLPAWKVPT 

723 TCAQCCTGGGAGGTGGAGGCCATGCCCCTGCGCCTCCTCCCGGCC^^ 

217 LLALGLPVCFyAYNFVRDVIj 

783 CTBCKraCCCTOGGGCTCTTOSTCTGCTTCTATGCC^ 

237 QPyVQESQNKPFKLPVSVVM 

843 CAGCCCTATGTGCy^QAAAGCCAQAACAAQTTCTTCAAGCTGCCX^ 

257 T TLPCVAYVLLSLVYLPGVL 

903 ACCACACTGCCGTGCGTGGCCTACGTGCTGCTGTCACTCGTGTACTIXKICCaGCGTGCT 

277 AAAL QLRRGTKYQRFPDWLD 

9 63 GOGGCTGCCCTGCAGCTGCGGCGCGGCACCAAGTACCAGCGCTTCCCCGACTGGC^ 

297 HWLQHRKQIGLXjSFFCAALH 

023 CACTOGCTACAGmCCGCAAGCAGATCGGGCrGCIXyiGCTO 

317 ALYSFCLPLRRAHRYDLVNL 

.083 6CCCTCTACAGCTTCTGCrnX3CCGCTGCQCCGCGCCC3^CCTCTACGACCT^ 

337 AVKQVLAKKSHLWVEEEVWR 

.143 GCAGTOU^GCAGGTCITGGCCAACAAGAGCCACCTCTGGGTGGAGGAGGAGGTCTGG^ 

357 MEIYLSLGVLALGTLSLLAV 

.203 ATGGAGATCTACCrrCTCCCTOGGAGTGCTGGCCCTCGGCACGTTGTCCCTGCTXSGCCGTO 
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377 TSLPSIANSIiNWREFSFVQS- 

12 63 ACCTCACTGCCGTCCATTGCAAACTCGCTCAACTGGAGGGAGTTCTVGCTTC 

397 SLGFVALVLSTLHTLTYGWT 

1323 TCACTGGGCTTTGTGGCCCTCGTGCTGAGCT^CACTGCACyvCGCTCACCTACXS 

417 RAFEESRYKF YLPPTFTLTL 

1383 CGCGCCTTCGAGGAGAGCCGCTACAAGTTCTACCTGCCTCCOVCCTTCJICGCTCATC 

437 LVPCVVIIiAKALFLLPCISR 

14 4 3 CTGGTGCCCTGCGTCGTCATCCTGGCCAAAGCCCTGTTTCTCCTGCCCTGCATCAGCCGC 

457 RLARIRRGWERESTIKFTLP 

1503 AGACTCGCCAGGATCCGGAGAGGCTGGGAGAGGGAGAGCACCATCAAGTTCACGCTGCCC 

477 TDHALAEKTSHV- 

1563 ACAGACCACGCCCTGGCCGAGAAGACGAGCCAaSTATGAGGTGCCTGCCCTGGGCT^ 

1623 ACCCCGGGCACACGAGGGACGGTGCCCroAGCCCGTTAGGTTTTCTTT^^ 

1683 AAAGTGGTATAACTGTGTGCAAATAGGAGGTTTGAGGTCCAAATTCCTGGGACT 

1743 TATGCAGTACTATTCAGAATGATATACAO^CATATGTGTATATGTATTTACAT^ 

1803 . ACATATATAACAGGATTTGCAATTATACATAGCTAGCrA;^^ 

1863 TCAACTTGTAGATTTAAAAACAAGTGCCGTACGTTAAGAGAAGAGCAGATC^ 

1923 TGACATTTGCAGAGATATACACACACTTTTTGTACAGAAGAGGC^^ 

1983 TCGATTTATCCCTGCCCACCCCATCCCCACAACTTCCCTT^^ 

2043 TTGCAGAGCTAGGGCTCTGAAGGGGAGGGAAGGCAACGGCTCTGCCCAGAGCCyV^^ 

2103 GAGCATGTGAGCAGOSGCTGGTCTCTTCCCTCCACCTGGGGCAGCyVC^ 

2163 GGGGAGGAAAATCAGGCAGTCGGCCTGGAGTCTGTGCCTGGTCCTTO 

2223 AGGATGGAGGGATTGGGCTCAAGCTGCTCCACCTCyiTCCTT^ 

2283 TTCCCTGAAAGTCAGAAGTCACCATAGAGCCTGCA^ 

2343 TCACCTCCTTTCCAGAGCCATTAGTGAGCCTGGCTTGG 

2403 TCCTTTAACCTGGCGATGAGCGTCCTTTAAACCACTGTGCCCT 

2463 CAGTTTGAACckcTCCCAGQAAGGCCTAGAGCAGACCCIT 

2523 AGAGCAAGAGAAAACACTCTAGGGAGTAAAGCTCCaZIGGGCGTCAGAGT^^ 

2583 TGGGCTGAAGGACTGTCTTCAOWIGTCAGTCCIX^ 

2643 GTCCTCTGGCAGAGGACCCAGAAAACCACyVCTGGCTCCAACTTCCTCCTCA 

2703 ACACTTCT^AAACAGTGGGGAGCAACITTTCCACCAAAGCTAa^ 

2763 CCCCAAAGCACAAGAGGGAAGAGCACCGCCGGGGCCACAGGACGTCTGTCCTCCAGTCAC 

2823 AGGCCATCCTTGCTGCTCCCTACTGACTCTAGCTTACTTCCCCTGTGAAGAAACAGGT^ 

2883 TCTOTGCTGAGCCCCCAACCCTCTGCAGAACCy^GGTTGATCTGCCACAGA;^^ 

2943 TTGAAGACA?VAGAGGGTGAGGTCTTC:7^TGAGTCrCCTGGGCCCAAAGC^ 

3003 GAAGGAAGAGAGTAGGGCCAGTGAAGGCTGCCCAGAGAGAATGTCACAGATGAGGCTGCC 

3063 CCTGCCCCCTCCCCGCCAGGGAGGTXTCATGAGCTCATGTCTATGCAGCACATAAGGGTT 

3123 CTTCAGTGAAAAGCAGGAGAAGAGCCOICTGCAAGGATAGCTCATTAGGCACATGACCGA 

3183 TGCAGGGAAGGCCATGCCGGGGAAGCTCTTCCTGCAGGTATTIT 

3243 GGCTGAGCGGCAGAAACTTGTCTCATAAATTGGCACTGATGGAGCATCAGCTGTG^ 

3303 CAGAGAGCCTTGCTGAGAAGGGGGCAGGTAAAGOiGAGATTTTAGCATTGCCa^ 

3363 ACy^AGGGCCCATCGATTCCCTACTAATGAGAGGCAGGGAGAGCATGGGCAATO 

3423 ACCAATGATCCCCAACCCCGGTGGGTACTGGCTGCCnXSCCCTGGGCCAGGG;^ 

3483 TTATACCAAAGATGCTGGCACATAGCAGAACCCAGTGCACGTCCTCCCCTTCC^^ 

3543 CTCTGGCTGAAGGTGCrCAAGAGGGAAGCAATTATAAGGTC^ 

3603 TGCCACCTGCTOGACAATCACACGAAAGGCT^GGCGGGCTGTGTACTGGGCCCTQACTGTO 

3663 CGTCCACTGCTGTCITCCCTACCTCTVCCAGGCTACTGGCAGCAGCATCCCGAGAGC^ 

3723 Cy^TCTCCACAGCCTGGTAAATTCCATGTGCCTCTGGGTACaA^ 

37 83 CTCTCGAAATCCCAAATGCa^CAGTCTGAGGTTGATATCTJ^^ 

3843 AGTCTCTCTTTTTTTTTTTTAACCTGGTAGACGGTATAAA^ 

3903 AACCTTCTGC 
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1 @GCGGCGGCTCCTGCAGCGGTGGTCX3GCrGTTGG6TGTGGAGTTTCCCAGCGCCCCTCGGG 
1 ■ M T 

62 TCCGACCCTTTGAGCGTTCTGCTCaSGCGCCAGCCTACCTCGCTCCTCG GCGCCATGA CC 



3 TTTTFKGVDPNSRNSSRVLR 

12 2 ACAACCACCACCTTCAAGGGAGTCGACCCCAACAGCAGGAATAGCTCCaSAGTTOT 

23 PPGGGSNFSLGFDEPTEQPV 

182 CCTCCAGGTGGTGGATCCAATTTTTCATTAGGTTTTGATGAA 

43 RKNKMASNI FG TPEENQASW 

242 AGGAAGAACmAATGGCCTCTAATATCrrTGGGACACCrGAAGAAAATCAAGCTTCT^ 

63 AKSAGAKSSGGREDLESSG'L 

302 GCCAAGTCy^GCAGGTGCCAAGTCTAGTGGTGGCAGGGAAGACTTGGAGTCATCrGG 



83 QRRNSSEASSGDFLDLKGEG 

362 CAGAGAAGGAACTCCTCTGAAGCAAGCrCCGGAGACTTCrTAGATCTr^ 

103- DIHENVDTDLPGSLGQSEEK 

^ , 422: GATATTCATGAAAATGTGGACACAGACTTGCCAGGCAGCCTGGGGCAGAGTGAA 

123 PVPAAPVPSPVAPAPVPSRR 

482 CCCGTGCCTGCTGCGCCTGTGCCCAGCCCGGTGGCCCCX3GCCCCAGTGCCATCCAGAAGA 

143NPPGGKSSIiVLG* 

542 AATCCCCCTGGCGGCAAGTCCAGCCTCGTCTTGGGTTAGCTCTGACro 

602 TCGTTCTGTCTGTTTCCTCCATGCTTGTGAACTGCACAACTT^ 

662 CTTGGATTTGTTTC fiTTAAAAAGAAGCACTTTATGTA CTGCTC 

722 TTGAAGAACy^GGTTTCTCTCTGTCCTTGACTCTTGGGTCTGTGGGCO^TGGCAT^ 

782 TTTCTAGTAGTAGATTGGAGGGAAAGCTTTGTGACACTTAGTACTGTGTTTTTAAGAAGA 

842 AATAATTTGGTTCCAGATGTGTTAGAGGATCTTTTGTACTGAGGTTTTTAACACT^ 

902 TGGGTTTACCAAGCCTCAACTGGACAGACCATAAACAGTCCACAGGCACCGTT 

962 GGCCCCAACCCACAGGGAGTCrCTCaXIAGAGCCTTCrTGGTGTTGCCCTAAC^^ 

1022 TGGCOTTTGCTCAGAGCCTCCTCCTGTGACATGTGAACAATGAAGAGGCCTGro 

1082 GCCTTGCCGCCTGCAAAGCAAAGAAAOTGCCTTTTATT^^ 

1142 GATAGTAACAAGACTGGCTGGCTGATGAGCAAAGCCrrTGCTCTC^^ 

1202 TGGATGTACAATGAAACTGCCTGGAACTAAAAGCAGTCAAGCAAGGGAGGCAAT^^ 

1262 GAAGCX3GGTCTTCCTCCAGGAACGGGGTCCCACAGGCGTGTTGTTTTAAAT 

1322 CTGTGTGCATGATGCTGGTGCTTGACCATGAAAGGAAAGTCTCATCCTTAAAATGTO^ 

1382 TACTTCACTATCCTGGACTGTTGCTTC aAGTAAA CAATATCmCATTC^ 

1442 AAAAAAAAAAAAAAT^AAAAA 
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FIGURE IIB 



Predicted promoter 

tgaaaaccc |tataa[ aggcgtcgatcggccggacaggcggc§gcggcggct 



SSH9 EXON-Intron boiindaries; 

EXONl CATGACCACAACCaccaccttcaaggga«. INTl ^tgccattatttgcagAGTTTTGCGGCCT 
EX0N2 AAATCAAGCTTCTtgggccaagtcagca™ INT2 «tattttgatttttagGTGCCAAGTCTAG 
EX0N3 CTTAGATCTGAAGgtcagtgtgacagca-. INT4 «ttttttcttttctagGGAGAAG 
EX0N4 GTGATATTCATGgtaagtacttctgaa... INT5 «.tccctgttttcatagAAAATGTGGACAC 
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1 ATGACCGACGCGCTGTTGCCCGCGGCCCCCCAGCCGCTGGAGAAGGAGAACGACGGCTAC 

6 1 TTTCGGAAGGGCTGTAATCCCCTTGCACAAACCGGCCGGAGTAAATTGCAGAATC^ 

12 1 GCTGCTTTGAATCAGCAGATCCTGAAAGCCGTGCGGATGAGGACCGGAGCGGAAAACCTT 

181 CTOAAAGTGGCCACAAACTCT^GGTGCGGGAGCAAGTGCGGCTGGAGCTGAGCTTCGT^ 

241 AACTCAGACCTOCAGATGCTCAAGGAAGAGCTGGAGGGGCTGAACATCTCGGTGG 

301 TATCAGAACACAGAGGAGGCATTTACGATTCCCCTGATTCCTCTTGGCCTGAAGGAAACG 

361 AAAGACGTCGACTTTGCAGTCGTCCTCAAGGATTTTATCCTGG^CATTACAGTG^ 

421 GGCTATTTATATGAAGATGAAArrGCAGATCTTATGGATCTGAGACAAGCTTGTCGGACG 

481 CCTAGCCGGGATGAGGCCGGGGTGGAACTGCTGATGACATACTTCATCCAGCTGGGCTTT 

541 GTCGAGAGTCGATTCTTCCCGCCCACACGGCAGATGGGACTCCTGTTCACCTGGTATGAC 

601 TCTCTCACTGGGGTTCCGGTCAGCCAGC7VGAACCTGCTGCTGGAGAAGGCCAGTGTCCT 

661 TTCAACACTGGGGCCCTCTACACCCAGATTGGGACCCGGTGCGATCGGCAGA 

721 GGGCTGGAGAGTGCCATAGATGCCTTTCAGAGAGCCGCAGGGGTTTTAAAT^ 

781 GACACATTTACCCATACTCCAAGTTACGACATGAGCCCTGCCATGCTCAGCGTGCT 

841 AAAATGATGCTTGCACAAGCCCAAGAAAGCGTGTTTGAGAAAATCAGCCTTC^ 

901 CGGAATGAATTCTTCATGCTGGTGAAGGTGGCTCAGGAGGCTCCTAAGGTC 

961 TACCAACAGCTACACGCAGCCATGAGCCAGGCGCCGGTGAAAGAGAACATCCCCTACTCC 

1021 TGGGCCAGCTTAGCCTGCGTGAAGGCCCACCACTACGCGGCCCTGGCCCACTACn~^ 

1081 GCCATCCTCCTCATCGACGA.CCAGGTGAAGCCAGGCACGGATCrrGGACC^ 

1141 TGCCTGTCCCAGCTCTACGACCACATGCCAGAGGGGCnX3ACACCCTTGGCaVC^ 

1201 AATGATCAGCAGCGCOSACAGCTGGGGAAGTCCCACTrGCGCAGAGCCATGGCTCATC^ 

1261 GAGGAGTCGGTGCGGGAGGCGAGCCTCTGCAAGAAGCTGCGGAGCATTGAGGTGCTAC^ 

1321 AAGGTGCTGTGTGCCGCACaGGAACGCTCCCGGCTCAaSTACGCCCAGCACC^ 

1381 GATGACCTGCTGAACCTGATCGACGCCCCCAGTGTTGTTGCrAAAACTGAGCTVA 

1441 GACATTATATTGCCCCAGirCTCCAAGCTGACAGTCACGGACTO 

1501 CCCTTATCTGTGTTTTCGGCTAACAAGCfeGTGGACGCCTCCTCGAAGC^ 

1561 GCAGAAQAAGGGGACTTGGGGTTCACCTTGAGAGGGAAaSCCCCCGTTCAGGra 

1621. CTGGATCCTTACTGCTCTGCCTCGGTGGCAGGAGCCCGGGAAGGAGATTATATTGTCTCC 

1681 ATTCAGCTTQTGGATTGTAAGOXSGCTGACGCTGAGTCAGGTTATGAAGCTGCTG^ 

1741 TTTGGCGAGGACGAGATCGAGATGAAAGTCGTGAGCCTCCTGGACTCCACATCATCC^ 

^ 1801 CATAATAAGAGTGCCy^CATACTCCGTGGGAATGCAGaAAACGTACTCC^ 

1861 GCCATTGATGATGACGACAAAACrGATAAAACCAAGAAAATCTCC^ 

; 1921 CTGAGTTGGGGCACCAACAAGAACAGAC^^GAAGTCAGCC^ 

1981 GTCGGGGCTGCACGGCCTCAGGTCAAGAAGAAGCTGCCCTCCCCTTTCAGCCTTCTC^ 

2041 TCAGACAGTTCTTGGTACTAATGTGAGGAAACAAACATGTTCJV 

2101 GTGCTGACTCGGCCTTAAACGTTTGTGCCATAATGGAAAATATCTATCTATCrGTTCTC^ 

2161 AATCCTGTTTTTCTCATAGTGTAAACTCACATTTGATGTGTTTTTATGAAGGAAA 

2221 CAAGAAACCTCTAGGAATTAGTGAAAAAAGAACXrrTTTGAGGTC 

2281 GTAAGTTATTTATTATATAAAGTATTGtAAATAGAATAGTGTTGAAGATATGAAATATGG 

2341 CTATTTTTAATGGTGACJ^TTATGACTTTTAGT(:a.CTATTAAATTGGGG 

2401 AGTACAATTTGTAGTTGTTTCCAGGTTTGGCTAATAATCATTCCTTAACCTA 

2461 ATGATCCTGGAATTAAGGCAGGTCAGAGGACTGTAATGATAGAATTAAATTAGTGTCACT 

2521 AAAAACTGTCCCAAAGTGCTGCTTCCTAATAGGAATTCATTAACCTAAAACAAGATGTTA 

2581 CTATTATATCGATAGACTATGAATGCTATTTCTAGAAAAAGTCTAGTGCCAAATTTGTCT 

2641 TATTAAATAAAAACAATGTAGGAGCAGCTTTTCTTCTAGTTTGATGTCTVTTTAAGAAT^ 

2701 CTAACACaVGTGGCAGTGTTAGATGAAGATGCTGTCTACAAGGTAGATAATATACTGTTTG 

2761 ATACTCAAAACATTTTTCATTTTGTa'TAAAGTAGAAGTTACATAATTCT^ 

2821 CTTGGGTAAAAAAGTAGTTTTACATTTTATAAAGTAAAGATGTAAATGATTCAGGTTTAA 

2881 AGCTCTATTTGACTTCCTTTTTTTGTTTGAGATAGCGTCTTO 

2941 AGTGCAGTGGTGTGATCTCAGCTCAGTGCAACCTCCGCCCCCTGGGATCAAGCGATTCT 

3001 CTACCrCAGCCTCCCAAATAGCTGGGACTACAAGGTGCCCTCCAGCATGC 

3061 TTTGTATTTTTAGTTGAGGTGAGGTTTCACCATGTTGGCCAGGCGGGTT^ 

3121 ACCTCAAATGATCCACCCACCTGAGCCTCCCAAAGTGCTGGGATTAC^ 

3181 CACAACCGTCCCACTATTTTACTTTTTAAAATGACATTCCTACTGATTGATl^ 

3241 GCTATAAGTTCGATGACACCGTGAATCTAATAAGGTTCACTGTTGACACAGTACAAGTTA 

3301 CATAGCTAAAATACATAGCATTGAAGACTAATTTTAAGGATTGACy^GAGTT^ 

3361 ATTGTGCAATATCTTAAAGGAAGCAACCACCTTTGGGAAAGTGTATCTGCTGCTCCTAGG 

3421 GCCATGCTTGTATACATATTTAAATAAACATATTCATTTACCCGAAAAAAAAAA^^^ 

3481 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?VAAAAAAAAAAAAAAA 
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1 MTDAIiLPAAPQPLEKENDGY 

1 ATGACCGACGCGCTGTTGCCCGCGGCCCCCCAGCCGCTGGAGAAGC5AGAACGACGGCTAC 

21 FRKGCNPLAQTGRSKLQNQR 

61 TTTCGGAAGGGCTGTAATCCCCTTGCACMACCGGCCGGA6TAAATTGCAGAATCAAAGA 

41 AALNQQILKAVRMRTGAENL 

121 GCTGCTTTGAATCAGCAGATCCTGAAAGCCGTGCGGATGAGGACCGGAGCGGAAAACCTT 

61 LKVATNSKVREQVRIiELSFV 

181 CTGAAAGTGGCCACAAACTCAAAGGTGCGGGAGCAAGTGCGGCTGGAGCTGAGCT^ 

81 NSDLQMLKEELEGLNISVGV 

241 AACTCAGACCTGCAGATGCTCAAGGAAGAGCTGGAGGGGCTGAACATCTCGGTGGGCGTC 

101 YQNTEEAFTI.PLIPLGLKET 

301 TATCAGAACACy^GAGGAGGCATTTACGATTCCCCTGATTCCrrCTTGGCCT 

X21 KDVDFAVVIiKDFILEHYSED 

361 AAAGACGTCGACTTTGCAGTCGTCCTG?^GGATTTTATCCr^^ 

141 GYLYED.EIADLMDLRQAC R T 

421 GGCTATTTATATGAAGATGAAATTGCAGATCTTATGGATCrGAGAC^ 

161 PSR.DEAGVELLMTYFIQLGF 

481 CCTAGCCGGGATGAGGCCGGGGTGGAACTGCntSATGACATACTTGAT^ 

181 VE S R F F P P TRQMG.L L F TWYD 

541 GTCGAGAGTCGATTCTTCCCGCCCACACGGCAGATGGGACTCCTGTTCACCTGGT^^ 

201 S I* T G V P V S Q Q N L Ij ^ L E K A S V L 

601 TCTCTCACTGGGGTTCCGGTCyVGCCAGCAGAACCTCCTGCT^ 

221 F NTGALYTQIGTR CDRQ TQA 

661 TTCAACACTGGGGCCCTCTACACCCAGATTGGGACCCGGTGCGATCGGCAGACGCAGGC 

241 GLESAIDAFQRAAGVLNYLK 

721 GGGCTGGAGAGTGCCATAGATGCClTTCAGAGAGCaSCAGGGGTITrAAATTACC^^ 

261 DTFTHTPSYDMSPAMXiSVLV 

781 GACACATTTACCCATACTCCAAGTTACGACATGAGCCCTGCCATGCTCy^GCGTGCTCGT^ 

281 KMMLAQAQESVFEKISLPGI 

841 AAAATGATGCTTGCACAAGCCCAAGAAAGCGTGTTTGAGAAAATCAGCCTTCCT^ 

301 RNEFFMLVKVAQEAAKVGEV 

901 CGGAATGAATTCTTCATGCTGGTGAAGGTGGCTCyVGGAGGCTGCTAAGGTGGGAGAGGTC 

321 YQQLHAAMSQAPVKENIPYS 

961 TACCAACAGCTACACGO^GCCATGAGCCAGGCGCCGGTGAAAGAGAACATCCCCTACTCC 

341 WASLACVKAHHYAALAHYFT 

1021 TGGGCCAGCTTAGCCTGCGTGAAGGCCCACCACTACGCGGCCCTGGCCCACTACTTCACT 

361 AILLIDHQVKPGTDLDHQEK 

1081 GCCATCCTCCTCATCGACCACCAGGTGAAGCCAGGCACGGATCTGGACCACCAGGAGAAG 
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381 CLSQLYDHMPEGLTPLATLK 

1141 TCCCrarcCCAGCTCTACGACCACATGCCAGAGGGGCTGACACCCTTTC 

401 NDQQRRQLGKSHLRRAMAHH 

1201 AATGATCAGCAGCGCCGACAGCTGGGGAAGTCCCACTTGCGCAGAGCCATGGCTC^ 

421 EESVREASLCKKLRSIEVLQ 

1261 GAGGAGTCGGTGCGGGAGGCGAGCCTCTGCAAGAAGCTGCGGAGCATTGAGGTGCTACAG 

441 KVLCAAQERSRLTYAQHQEE 

1321 AAGGTGCTGTGTGCCGCACAGGAACX3CTCCCGGCTCACGTACGCCCAG(:a.CC^^ 

461 DDLLNIiIDAPSVVAKTEQEV 

1381 GATGACCTOCIGAACCTGATCGACGCCCCOVGTGTTGTTGCTAAAACTGAGCAAGAOT 

481 DIXLPQFSKLTVTDFFQKIiG 

1441 GACATTATATTGCCCCAGTTCTCCAAGCTGACAGTCACGGACTTCrTCCAGAA^^ 

501 PLSVFSANKRWTPPRSIRPT 

1501 CCCTTATereTGTTTTCXSGCTAACAAGCGGTGGACGCCTCCTCGA^ 

521 AEE GDIiGFTLRGNAPVQVHF 

1561 GOlGAAGAAGGGGACTTGGGGTTCACCTTGAGAGGGAACGCCCCCGTTCAGGTrCAC^ 

541 LDPYCSASVAGAREGDYIVS 

1621 CTGGATCCTTACTGCTCTGCCTCGGTGGCAGGAGCCCGGGAAGGAGATTATATTGTCTCC 

561 I Q L V D'C K W L T L S E V M K L L K S 

1681 attcagcttgtggattctaagtggctgacgctgagtga^^ 

581 fged e ie'mkvvslldsts SM 

1741 tttggcgaggacgagatcgagatgaaagtcgtgagcctcctggactccacatcat 

601 hnksa ty s'vgmqktys m i cl 

1801 ca^taataagagtgccacatactccgtgggaatgavgaaaacgtactcavtgatctgc^ 

621 aidddd ktdktkki s kkl s f 

1861 gccattgatgatgacgacaaaactgataaaaccaagaaaatctccaagaagctt^ 

641 lswgtnknrqksastlclps 

1921 ctgagttggggcaccaacaagaacagacagaagtcagccagcaccttgtgcctccc^^ 

661 vgaarpqvkkklpspfsliin 

1981 gtcggggctccacggcctct^ggtcaagaagaagctgcccrrccccn^ 

681 S D S S W Y - 

2041 TCAGACAGTTCTTGGTACTAATGTGAGGAAACAAACATGTTCAGGCCCCGAACAT^ 
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TATA PROMOTER AND PUTATIVE TRANSCRIPTION START SITE 

AAAAAAAATAAATAAAAAGGCCGGGCGCGTTGGCCCGCGCcTGCAGCCCC 

PSL 22_5'UTR 

1 TGCTACTTGGGAGGCTGAGGCTGGAGCATCGCTTGATCCTGGGAGGTCGAGGCTGCT^ 

6 1 AGTCGAGATCGCAACACTGCTCTCCAGCCTOGGCGACAGAGCGA^ 

121 AAAAAGAACTGTGCTCAAGGACATCTGCCGTGTCTGGGGCGCAAAACCCCTCCTGGTCCC 

181 CTCTCTCAGGGCAGTCCGCGAGCCCAGCGGATCCCACTCGTCTTTGCAGCGCGGACAGGG 

241 AATCGGCTGAGTTGATCCCATGCCAACAAGCCCGAGTAGTCCGGGCAAGGCGCTCGGCGG 

301 GGCAGTCAACGCTCCCTCCGCCATGGGCTCCCCTCTTGGGAAAAGCTTTTCCy^ 

361 GGGCCCAGGGCCCAGAGCTCCCGCCGCGCCCTCGACGTGGCGTCGAGTCTGGCCCCTTCC 

421 CCCGCGGCGCACGGGCTTCACCCAGGAGGGACGCGCCTGGATCCACGCCTTCCTCACT^ 

4 8 1 CfTCCCCGGGCTCCAGGGCAGGGTGCAGGTCCACAGCCAGGGCTTCG 

541 ACCCCAGTGCCTTTCCTGCGCTCTCGCGGCACTCGCAAAGTTGAGTCAGCCACGACGCCC 

601 ACAGACAACCCCGAGGCGCCGCGCCCAGGGCQCAGCTCTCCGGGTGACGAGCGCCTCAAG 

661 GGGCGCGGGTTCGGGGCCCGCGACGGGGCGGGGOSCGTCrCCy^GGGCTCCAGTGCTCGGC 

721 CTCAGGCGGGGCTAGAAGGGCCGCGGGACGGGGTGGGAGTGGAGGGGCGGGGAAGGGCGG 

781 GGACAGGGGCGGGGCCGCACGTCCTCTCGGGCCAGCCTCAGCCGCCGCGCCTCAGTCCGC 

841 CGTCCGCCCTCCGCX3CCCGCGCCGCTAGC 

EX0N_1 69bp 

1 ATGACaSACGCGCTOTTGCCCGCGGCCCCCCAGCaSCTGQAGAAGGAGAA 
61 TTTCGGAAG 

EX0N_2 117bp 

1 GGCTGTAATCCCCTTGCACAAACCGGCCXSGAGTAAATTGCAGAATCAAAGAGCr^^ 
6 1 AATaUSCaVGATCCrGAAAGCCGTGCGGATGAGGACaSGAGOSGAA^ 

EX0N_3 129bp . 

1 GTGGCO^CAAACmZAAAGGTGCGGGAGCAAGTGCGGCTGGAGCT 
6 1 GACCTGCAGATGCTCAAGGAAGAGCTGGAGGGGCTGAACATCTCGGTGGGCGT 

121 AACACAGAG ; 

EX0N_4 75bp 

1 GAGGCATTTACGATTCCCCrrGATTCCTCTTGGCCTGAAGGAAACGAAAGACGTCGACT^ 
6 1 GCAGTCGTCCTCAAG 

EX0N_5 79bp 

1 GATTTTATCCTGGAAOVTTACAGTGAAGATGGCTATTTATATGAAGATGAAATTGCAGAT 
61 CTTATGGATCTGAGACAAG 

EX0N_6 124bp 

1 CTTQTCGGACGCCTAGCCGGGATGAGGCCGGGGTGGAACTGCTGATGACATACTTCATCC 
6 1 AGCTGGGCTTTGTOSAGAGTCGATTCTTCCCGCCCACAaSGCAGATGGGACT 
121 CCTG 

EX0N_7 167bp 

1 GTATGACTCTCTCACTGGGGTTCCGGTCAGCCAGCAGAACCTGCTGCTGGAGAAGGCCAG 
6 1 TGTCCTGTTCAACACTGGGGCCCTCTAO^CCCAGATTGGGACCCGGTGCGATCXSGa^ 
12 1 GCAGGCTGGGCTGGAGAGTGCCATAGATGCCTTTCAGAGAGCCGCAG 

EX0N_8 188bp 

1 GGGTTTTAAATTACCTGAAAGACACATTTACCaiTACTCCAAGTTACGaCATGAGCCC 
6 1 CCATGCTCAGCGTGCTCGTCAAAATGATGCTTGCACAAGCCaJ^GAAAGCGTG^^ 
121 AAATCAGCCTTCCTGGGATCCGGAATGAATTCTTCATGei^TGAAGGT^ 
181 CTGCTAAG 
EX0N_9 156bp 

1 GTGGGAGAGGTCTACCAACAGCTACACGCAGCCATGAGCCy^GGCGCCGGTGAAAGAGAAC 
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61 ATCCCCTACTCCTGGGCCAGCTTAGCCTGCGTGAAGGCCCACCACTACGCGGCCCTGGCC 

121 CACTACTTCACTGCCATCCTCCTCATCGACCACCAG 

EXON_10 120bp 

1 GTGAAGCCAGGCACGGATCTGGACCACCAGGAGAAGTGCCTGTCCCAGCTCTACGACC^^ 

6 1 ATGCCAGAGGGGCTGACACCCTTGGCCACACTGAAGAATGATCAGCAGCGCCGACA 

EX0N_11 196bp 

1 GGGAAGTCCCACTTGCGCAGAGCCATGGCTCATCACGAGGAGTCGGTGCGGGAGGCGAGC 

6 1 CTCTGCAAGAAGCTGCGGAGCATTGAGGTGCrrACAGAAGGTGCro 

121 CGCTCCCGGCTCACGTACGCCCy^GCACCAGGAGGAGGATGACCTGCTGAACCTGATC^ 

181 GCCCCCAGTGTTGTTG 

EX0N_12 77bp 

1 CTAAAACTGAGCAAGAGGTTGACATTATATTGCCCCAGTTCTCCAAGCTC 

6 1 ACTTCTTCCAGAAGCTG 

EX0N_13 147bp 

1 GGCCCCTTATCTGTGTTTTCGGCTAACAAGCGGTGGACGCCTCCTCGAAG 

6 1 ACTGCAGAAGAAGGGGACTTGGGGTTCACCTTGAGAGGGAACGCCCC^^ 

12 1 TTCCTGGATCCrTACTGCTCTGCCTCG 

EXON_14 156bp 

1 GTGGCAGGAGCCCGGGAAGGAGATTATATTGTCTCCATTCAGCITGT^^ 

61 CTGACGCTGAGTGAGGTTATGAAGCTGCTGAAGAGCTTTGGCG^ 

12 1 AAAGTCGTGAGCCTCCTGGACTCCACATCATCCATO 

EX0N_15 +3'UTR 1664bp+polyA tract 

1 CATAATAAGAGTGCCACATACTCCGTGGGAATGO^GAAAACGTACTCCATGATCTO 

6 1 GCCATTGATGATGACGACAAAACnXSATAT^CCAAGAAAATCT 

12 1 CTGAGTTQGGGCACCAACy^GAACTlGACAGAAGTCAGCCAGCACCOT 

181 GTCGGGGCTGCAOKSCerCa^GGTCAAGAAGAAGCTGCCCTCCCC^ 

241 TCAGACAGTTCTTGGTACTAATGTGAGGAAACAAACATGTTCAGGCCCOTAAC^ 

301 GTGCTGACTCGGCCTTAAACGTTTGTGCCATAATGGAAAATATCTATC^ 

361 AATCCTGTTTTTCTCATAGTGTAAACrCACATTTGATC 

421 CAAGAAACCTCTAGGAATTAGTGAAAAAAGAACTTTTTTC 

481 GTAAGTTATTTATTATATAAAGTATTGTAAATAGAATAGTGTTGAAGATATGAAATATGG 

541 CTATTTTTAATGGTGACAATTATGACTTTTAGTCACTATTAAATTGGGGT^ 

601 AGTACAATTTGTAGTTGTTTCCAGGTTTGGCTAATAATCATTCCTTAACCTAGAATTC^ 

661 ATGATCCTGGAATTAAGGCAGGTCAGAGGACTGTAATGATAGAATTAAATTAGTGTCACT 

721 AAAAACTGTCCCAAAGTGCTGCTTCCTAATAGGAATTCATTAACCTAAAACAAGATGTTA 

781 CTATTATATCGATAGACTATGAATGCTATTTCTAGAAAAAGTCTAGTGCCAAATTTC 

841 TATTAAATAAAAAOU^TGTAGGAGCAGCTTTTCTTCTAGTTTGATGTCA^^ 

901 CTAACACAGTGGCAGTGTTAGATGAAGATGCTGTCTAC^GGTAGATAATATACTGTTTC 

961 ATACTCAAAACATTTTTCATTTTGTTTAAAGTAGAAGTTACATAAOT 

102 1 CTTGGGTAAAAAAGTAGTTTTACATTTTATAAAGTAAAGATGTAAATGATTCa^ 

1081 AGCTCTATTTGACTTCCTTTTTTTGTTTGAGATAGCGTCTTGCTGTGTTC 

114 1 AGTGCT^GTGGTGTGATCTCAGCTCAGTGCAACCrCCGCCCCCTGGGATCy^ 

1201 CTACCTCAGCCTCCCAAATAGCTGGQACTACAAGGTGCCCTCCAGCATGCCTGGCTGAOT 

1261 TTTGTATTTTTAGTTGAGGTGAGGTTTCACCyVTGTTGGCCAGG 

1321 ACCTCAAATGATCCACCCACCrCAGCCTCCCAAAGTGCr^ 

1381 CACAACCGTCCCACTATTTTACTTTTTAAAATGACT^TTCCTACTGATO 

1441 GCTATAAGTTCG ATGACACCGTGAATCTAATAAGGTTCACTGTTGACACAGTACAAGT^ 

1501 CATAGCTAAAATACyiTAGCy^TTGAAGACTAATTTTAAGGATTGAC^ 

1561 ATTGTGCAATATCTTAAAGGAAGCAACCACCTTTGGGAAAGTGTA^^ 

1621 GCCATGCTTGTATACATATTTaaataaACATATTCATTTACCCGAAAAAAAAAAAAA^ 

1681 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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